Dataset statistics
| Number of variables | 45 |
|---|---|
| Number of observations | 22083 |
| Missing cells | 94691 |
| Missing cells (%) | 9.5% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 41.0 MiB |
| Average record size in memory | 1.9 KiB |
Variable types
| CAT | 21 |
|---|---|
| BOOL | 18 |
| NUM | 6 |
Patient_First_Name has a high cardinality: 2524 distinct values | High cardinality |
Family_Name has a high cardinality: 6282 distinct values | High cardinality |
Fathers_name has a high cardinality: 16368 distinct values | High cardinality |
Autopsy_shows_birth_defect_(if_applicable) is highly correlated with Status | High correlation |
Status is highly correlated with Autopsy_shows_birth_defect_(if_applicable) | High correlation |
Place_of_birth is highly correlated with Institute_Name and 1 other fields | High correlation |
Institute_Name is highly correlated with Place_of_birth | High correlation |
Location_of_Institute is highly correlated with Place_of_birth | High correlation |
Disorder_Subclass is highly correlated with Genetic_Disorder | High correlation |
Genetic_Disorder is highly correlated with Disorder_Subclass | High correlation |
Patient_Age has 1427 (6.5%) missing values | Missing |
Inherited_from_father has 306 (1.4%) missing values | Missing |
Maternal_gene has 2810 (12.7%) missing values | Missing |
Family_Name has 9691 (43.9%) missing values | Missing |
Mothers_age has 6036 (27.3%) missing values | Missing |
Fathers_age has 5986 (27.1%) missing values | Missing |
Institute_Name has 5106 (23.1%) missing values | Missing |
Respiratory_Rate_(breaths/min) has 2149 (9.7%) missing values | Missing |
Heart_Rate_(rates/min has 2113 (9.6%) missing values | Missing |
Test_1 has 2127 (9.6%) missing values | Missing |
Test_2 has 2152 (9.7%) missing values | Missing |
Test_3 has 2147 (9.7%) missing values | Missing |
Test_4 has 2140 (9.7%) missing values | Missing |
Test_5 has 2170 (9.8%) missing values | Missing |
Parental_consent has 2125 (9.6%) missing values | Missing |
Follow-up has 2166 (9.8%) missing values | Missing |
Gender has 2173 (9.8%) missing values | Missing |
Birth_asphyxia has 2139 (9.7%) missing values | Missing |
Autopsy_shows_birth_defect_(if_applicable) has 1026 (4.6%) missing values | Missing |
Place_of_birth has 2124 (9.6%) missing values | Missing |
Folic_acid_details_(peri-conceptional) has 2117 (9.6%) missing values | Missing |
H/O_serious_maternal_illness has 2152 (9.7%) missing values | Missing |
H/O_radiation_exposure_(x-ray) has 2153 (9.7%) missing values | Missing |
H/O_substance_abuse has 2195 (9.9%) missing values | Missing |
Assisted_conception_IVF/ART has 2122 (9.6%) missing values | Missing |
History_of_anomalies_in_previous_pregnancies has 2172 (9.8%) missing values | Missing |
No._of_previous_abortion has 2162 (9.8%) missing values | Missing |
Birth_defects has 2154 (9.8%) missing values | Missing |
White_Blood_cell_count_(thousand_per_microliter) has 2148 (9.7%) missing values | Missing |
Blood_test_result has 2145 (9.7%) missing values | Missing |
Symptom_1 has 2155 (9.8%) missing values | Missing |
Symptom_2 has 2222 (10.1%) missing values | Missing |
Symptom_3 has 2101 (9.5%) missing values | Missing |
Symptom_4 has 2113 (9.6%) missing values | Missing |
Symptom_5 has 2153 (9.7%) missing values | Missing |
Genetic_Disorder has 2146 (9.7%) missing values | Missing |
Disorder_Subclass has 2168 (9.8%) missing values | Missing |
Fathers_name is uniformly distributed | Uniform |
Patient_Id has unique values | Unique |
Blood_cell_count_(mcL) has unique values | Unique |
Patient_Age has 1386 (6.3%) zeros | Zeros |
No._of_previous_abortion has 3964 (18.0%) zeros | Zeros |
Reproduction
| Analysis started | 2021-11-04 21:44:26.453481 |
|---|---|
| Analysis finished | 2021-11-04 21:45:20.487097 |
| Duration | 54.03 seconds |
| Software version | pandas-profiling v2.9.0 |
| Download configuration | config.yaml |
| Distinct | 22083 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 172.6 KiB |
| PID0x6038 | 1 |
|---|---|
| PID0x68a9 | 1 |
| PID0x2825 | 1 |
| PID0x9921 | 1 |
| PID0x91e | 1 |
| Other values (22078) |
| Value | Count | Frequency (%) | |
| PID0x6038 | 1 | < 0.1% | |
| PID0x68a9 | 1 | < 0.1% | |
| PID0x2825 | 1 | < 0.1% | |
| PID0x9921 | 1 | < 0.1% | |
| PID0x91e | 1 | < 0.1% | |
| PID0x3d09 | 1 | < 0.1% | |
| PID0x859e | 1 | < 0.1% | |
| PID0x1868 | 1 | < 0.1% | |
| PID0x2283 | 1 | < 0.1% | |
| PID0x801 | 1 | < 0.1% | |
| PID0x5e4e | 1 | < 0.1% | |
| PID0x7b99 | 1 | < 0.1% | |
| PID0x70aa | 1 | < 0.1% | |
| PID0x125b | 1 | < 0.1% | |
| PID0x4e4f | 1 | < 0.1% | |
| PID0x281c | 1 | < 0.1% | |
| PID0x434c | 1 | < 0.1% | |
| PID0x6949 | 1 | < 0.1% | |
| PID0x2dcf | 1 | < 0.1% | |
| PID0xe1b | 1 | < 0.1% | |
| PID0x6b5f | 1 | < 0.1% | |
| PID0x250e | 1 | < 0.1% | |
| PID0x4544 | 1 | < 0.1% | |
| PID0x64ed | 1 | < 0.1% | |
| PID0x3768 | 1 | < 0.1% | |
| Other values (22058) | 22058 | 99.9% |
Frequencies of value counts
Unique
| Unique | 22083 ? |
|---|---|
| Unique (%) | 100.0% |
Histogram of lengths of the category
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 8.894579541 |
| Min length | 6 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 0 | 26158 | 13.3% | |
| P | 22083 | 11.2% | |
| I | 22083 | 11.2% | |
| D | 22083 | 11.2% | |
| x | 22083 | 11.2% | |
| 6 | 6504 | 3.3% | |
| 7 | 6486 | 3.3% | |
| 8 | 6469 | 3.3% | |
| 5 | 6443 | 3.3% | |
| 4 | 6439 | 3.3% | |
| 3 | 6427 | 3.3% | |
| 1 | 6415 | 3.3% | |
| 2 | 6405 | 3.3% | |
| 9 | 5917 | 3.0% | |
| a | 4221 | 2.1% | |
| b | 4106 | 2.1% | |
| f | 4058 | 2.1% | |
| d | 4033 | 2.1% | |
| e | 4009 | 2.0% | |
| c | 3997 | 2.0% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 83663 | 42.6% | |
| Uppercase Letter | 66249 | 33.7% | |
| Lowercase Letter | 46507 | 23.7% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| P | 22083 | 33.3% | |
| I | 22083 | 33.3% | |
| D | 22083 | 33.3% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 0 | 26158 | 31.3% | |
| 6 | 6504 | 7.8% | |
| 7 | 6486 | 7.8% | |
| 8 | 6469 | 7.7% | |
| 5 | 6443 | 7.7% | |
| 4 | 6439 | 7.7% | |
| 3 | 6427 | 7.7% | |
| 1 | 6415 | 7.7% | |
| 2 | 6405 | 7.7% | |
| 9 | 5917 | 7.1% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| x | 22083 | 47.5% | |
| a | 4221 | 9.1% | |
| b | 4106 | 8.8% | |
| f | 4058 | 8.7% | |
| d | 4033 | 8.7% | |
| e | 4009 | 8.6% | |
| c | 3997 | 8.6% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 112756 | 57.4% | |
| Common | 83663 | 42.6% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| P | 22083 | 19.6% | |
| I | 22083 | 19.6% | |
| D | 22083 | 19.6% | |
| x | 22083 | 19.6% | |
| a | 4221 | 3.7% | |
| b | 4106 | 3.6% | |
| f | 4058 | 3.6% | |
| d | 4033 | 3.6% | |
| e | 4009 | 3.6% | |
| c | 3997 | 3.5% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 0 | 26158 | 31.3% | |
| 6 | 6504 | 7.8% | |
| 7 | 6486 | 7.8% | |
| 8 | 6469 | 7.7% | |
| 5 | 6443 | 7.7% | |
| 4 | 6439 | 7.7% | |
| 3 | 6427 | 7.7% | |
| 1 | 6415 | 7.7% | |
| 2 | 6405 | 7.7% | |
| 9 | 5917 | 7.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 196419 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 0 | 26158 | 13.3% | |
| P | 22083 | 11.2% | |
| I | 22083 | 11.2% | |
| D | 22083 | 11.2% | |
| x | 22083 | 11.2% | |
| 6 | 6504 | 3.3% | |
| 7 | 6486 | 3.3% | |
| 8 | 6469 | 3.3% | |
| 5 | 6443 | 3.3% | |
| 4 | 6439 | 3.3% | |
| 3 | 6427 | 3.3% | |
| 1 | 6415 | 3.3% | |
| 2 | 6405 | 3.3% | |
| 9 | 5917 | 3.0% | |
| a | 4221 | 2.1% | |
| b | 4106 | 2.1% | |
| f | 4058 | 2.1% | |
| d | 4033 | 2.1% | |
| e | 4009 | 2.0% | |
| c | 3997 | 2.0% |
| Distinct | 15 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 1427 |
| Missing (%) | 6.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.974147947 |
|---|---|
| Minimum | 0 |
| Maximum | 14 |
| Zeros | 1386 |
| Zeros (%) | 6.3% |
| Memory size | 172.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 3 |
| median | 7 |
| Q3 | 11 |
| 95-th percentile | 14 |
| Maximum | 14 |
| Range | 14 |
| Interquartile range (IQR) | 8 |
Descriptive statistics
| Standard deviation | 4.319475047 |
|---|---|
| Coefficient of variation (CV) | 0.6193552359 |
| Kurtosis | -1.215597674 |
| Mean | 6.974147947 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 0.00950741439 |
| Sum | 144058 |
| Variance | 18.65786468 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=15)
| Value | Count | Frequency (%) | |
| 4 | 1435 | 6.5% | |
| 12 | 1435 | 6.5% | |
| 9 | 1415 | 6.4% | |
| 2 | 1396 | 6.3% | |
| 5 | 1394 | 6.3% | |
| 0 | 1386 | 6.3% | |
| 13 | 1384 | 6.3% | |
| 3 | 1383 | 6.3% | |
| 6 | 1374 | 6.2% | |
| 1 | 1364 | 6.2% | |
| 11 | 1353 | 6.1% | |
| 7 | 1351 | 6.1% | |
| 8 | 1340 | 6.1% | |
| 14 | 1333 | 6.0% | |
| 10 | 1313 | 5.9% | |
| (Missing) | 1427 | 6.5% |
| Value | Count | Frequency (%) | |
| 0 | 1386 | 6.3% | |
| 1 | 1364 | 6.2% | |
| 2 | 1396 | 6.3% | |
| 3 | 1383 | 6.3% | |
| 4 | 1435 | 6.5% | |
| 5 | 1394 | 6.3% | |
| 6 | 1374 | 6.2% | |
| 7 | 1351 | 6.1% | |
| 8 | 1340 | 6.1% | |
| 9 | 1415 | 6.4% |
| Value | Count | Frequency (%) | |
| 14 | 1333 | 6.0% | |
| 13 | 1384 | 6.3% | |
| 12 | 1435 | 6.5% | |
| 11 | 1353 | 6.1% | |
| 10 | 1313 | 5.9% | |
| 9 | 1415 | 6.4% | |
| 8 | 1340 | 6.1% | |
| 7 | 1351 | 6.1% | |
| 6 | 1374 | 6.2% | |
| 5 | 1394 | 6.3% |
Genes_in_mothers_side
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 172.6 KiB |
| Yes | |
|---|---|
| No |
| Value | Count | Frequency (%) | |
| Yes | 13143 | 59.5% | |
| No | 8940 | 40.5% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 306 |
| Missing (%) | 1.4% |
| Memory size | 172.6 KiB |
| No | |
|---|---|
| Yes | |
| (Missing) | 306 |
| Value | Count | Frequency (%) | |
| No | 13133 | 59.5% | |
| Yes | 8644 | 39.1% | |
| (Missing) | 306 | 1.4% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2810 |
| Missing (%) | 12.7% |
| Memory size | 172.6 KiB |
| Yes | |
|---|---|
| No | |
| (Missing) |
| Value | Count | Frequency (%) | |
| Yes | 10647 | 48.2% | |
| No | 8626 | 39.1% | |
| (Missing) | 2810 | 12.7% |
Paternal_gene
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 172.6 KiB |
| No | |
|---|---|
| Yes |
| Value | Count | Frequency (%) | |
| No | 12508 | 56.6% | |
| Yes | 9575 | 43.4% |
| Distinct | 22083 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.898871078 |
|---|---|
| Minimum | 4.092727034 |
| Maximum | 5.60982897 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 172.6 KiB |
Quantile statistics
| Minimum | 4.092727034 |
|---|---|
| 5-th percentile | 4.570279662 |
| Q1 | 4.763108642 |
| median | 4.899398761 |
| Q3 | 5.033830033 |
| 95-th percentile | 5.228652022 |
| Maximum | 5.60982897 |
| Range | 1.517101936 |
| Interquartile range (IQR) | 0.2707213911 |
Descriptive statistics
| Standard deviation | 0.1996628593 |
|---|---|
| Coefficient of variation (CV) | 0.04075691238 |
| Kurtosis | -0.06282141886 |
| Mean | 4.898871078 |
| Median Absolute Deviation (MAD) | 0.1352425499 |
| Skewness | 0.01002341388 |
| Sum | 108181.77 |
| Variance | 0.03986525739 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 5.121511338 | 1 | < 0.1% | |
| 4.542396215 | 1 | < 0.1% | |
| 4.977424421 | 1 | < 0.1% | |
| 4.314907656 | 1 | < 0.1% | |
| 4.63380815 | 1 | < 0.1% | |
| 4.873329458 | 1 | < 0.1% | |
| 4.753001504 | 1 | < 0.1% | |
| 4.702750533 | 1 | < 0.1% | |
| 4.793014289 | 1 | < 0.1% | |
| 4.660101808 | 1 | < 0.1% | |
| 4.875446015 | 1 | < 0.1% | |
| 5.011891104 | 1 | < 0.1% | |
| 4.664839534 | 1 | < 0.1% | |
| 4.99880726 | 1 | < 0.1% | |
| 4.963989412 | 1 | < 0.1% | |
| 4.999673677 | 1 | < 0.1% | |
| 5.063542057 | 1 | < 0.1% | |
| 4.934590431 | 1 | < 0.1% | |
| 4.821767017 | 1 | < 0.1% | |
| 4.994143725 | 1 | < 0.1% | |
| 4.902499933 | 1 | < 0.1% | |
| 4.790835626 | 1 | < 0.1% | |
| 4.738422213 | 1 | < 0.1% | |
| 5.196355941 | 1 | < 0.1% | |
| 4.643701909 | 1 | < 0.1% | |
| Other values (22058) | 22058 | 99.9% |
| Value | Count | Frequency (%) | |
| 4.092727034 | 1 | < 0.1% | |
| 4.146229815 | 1 | < 0.1% | |
| 4.185821105 | 1 | < 0.1% | |
| 4.203464164 | 1 | < 0.1% | |
| 4.215599036 | 1 | < 0.1% | |
| 4.23572663 | 1 | < 0.1% | |
| 4.248565352 | 1 | < 0.1% | |
| 4.250212496 | 1 | < 0.1% | |
| 4.258798724 | 1 | < 0.1% | |
| 4.264795714 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 5.60982897 | 1 | < 0.1% | |
| 5.592450707 | 1 | < 0.1% | |
| 5.574096672 | 1 | < 0.1% | |
| 5.571966475 | 1 | < 0.1% | |
| 5.569902074 | 1 | < 0.1% | |
| 5.564212158 | 1 | < 0.1% | |
| 5.558932575 | 1 | < 0.1% | |
| 5.553951564 | 1 | < 0.1% | |
| 5.536403702 | 1 | < 0.1% | |
| 5.532782297 | 1 | < 0.1% |
| Distinct | 2524 |
|---|---|
| Distinct (%) | 11.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 172.6 KiB |
| James | 420 |
|---|---|
| John | 372 |
| Robert | 355 |
| Mary | 329 |
| Michael | 321 |
| Other values (2519) |
| Value | Count | Frequency (%) | |
| James | 420 | 1.9% | |
| John | 372 | 1.7% | |
| Robert | 355 | 1.6% | |
| Mary | 329 | 1.5% | |
| Michael | 321 | 1.5% | |
| David | 288 | 1.3% | |
| William | 287 | 1.3% | |
| Charles | 196 | 0.9% | |
| Richard | 184 | 0.8% | |
| Thomas | 174 | 0.8% | |
| Joseph | 171 | 0.8% | |
| Barbara | 130 | 0.6% | |
| Donald | 122 | 0.6% | |
| Jennifer | 120 | 0.5% | |
| Patricia | 119 | 0.5% | |
| Daniel | 115 | 0.5% | |
| Elizabeth | 111 | 0.5% | |
| Linda | 108 | 0.5% | |
| Mark | 108 | 0.5% | |
| Maria | 107 | 0.5% | |
| George | 106 | 0.5% | |
| Paul | 106 | 0.5% | |
| Margaret | 105 | 0.5% | |
| Christopher | 104 | 0.5% | |
| Dorothy | 100 | 0.5% | |
| Other values (2499) | 17425 | 78.9% |
Frequencies of value counts
Unique
| Unique | 1021 ? |
|---|---|
| Unique (%) | 4.6% |
Histogram of lengths of the category
Length
| Max length | 11 |
|---|---|
| Median length | 6 |
| Mean length | 5.794049722 |
| Min length | 2 |
Most occurring characters
| Value | Count | Frequency (%) | |
| a | 15248 | 11.9% | |
| e | 14048 | 11.0% | |
| r | 10014 | 7.8% | |
| n | 9819 | 7.7% | |
| i | 9228 | 7.2% | |
| l | 7403 | 5.8% | |
| o | 6262 | 4.9% | |
| t | 4850 | 3.8% | |
| h | 4723 | 3.7% | |
| s | 4097 | 3.2% | |
| y | 4002 | 3.1% | |
| d | 3281 | 2.6% | |
| J | 2962 | 2.3% | |
| c | 2702 | 2.1% | |
| m | 2316 | 1.8% | |
| M | 2197 | 1.7% | |
| u | 1857 | 1.5% | |
| R | 1751 | 1.4% | |
| D | 1645 | 1.3% | |
| C | 1574 | 1.2% | |
| A | 1406 | 1.1% | |
| S | 1379 | 1.1% | |
| b | 1354 | 1.1% | |
| L | 1235 | 1.0% | |
| B | 1061 | 0.8% | |
| Other values (27) | 11536 | 9.0% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 105867 | 82.7% | |
| Uppercase Letter | 22083 | 17.3% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| J | 2962 | 13.4% | |
| M | 2197 | 9.9% | |
| R | 1751 | 7.9% | |
| D | 1645 | 7.4% | |
| C | 1574 | 7.1% | |
| A | 1406 | 6.4% | |
| S | 1379 | 6.2% | |
| L | 1235 | 5.6% | |
| B | 1061 | 4.8% | |
| E | 995 | 4.5% | |
| K | 932 | 4.2% | |
| T | 928 | 4.2% | |
| G | 688 | 3.1% | |
| P | 684 | 3.1% | |
| W | 623 | 2.8% | |
| H | 534 | 2.4% | |
| N | 397 | 1.8% | |
| F | 380 | 1.7% | |
| V | 340 | 1.5% | |
| I | 164 | 0.7% | |
| O | 108 | 0.5% | |
| Y | 57 | 0.3% | |
| Z | 22 | 0.1% | |
| Q | 10 | < 0.1% | |
| U | 9 | < 0.1% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| a | 15248 | 14.4% | |
| e | 14048 | 13.3% | |
| r | 10014 | 9.5% | |
| n | 9819 | 9.3% | |
| i | 9228 | 8.7% | |
| l | 7403 | 7.0% | |
| o | 6262 | 5.9% | |
| t | 4850 | 4.6% | |
| h | 4723 | 4.5% | |
| s | 4097 | 3.9% | |
| y | 4002 | 3.8% | |
| d | 3281 | 3.1% | |
| c | 2702 | 2.6% | |
| m | 2316 | 2.2% | |
| u | 1857 | 1.8% | |
| b | 1354 | 1.3% | |
| v | 949 | 0.9% | |
| g | 932 | 0.9% | |
| f | 627 | 0.6% | |
| p | 606 | 0.6% | |
| k | 580 | 0.5% | |
| w | 524 | 0.5% | |
| z | 215 | 0.2% | |
| x | 103 | 0.1% | |
| j | 73 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 127950 | 100.0% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| a | 15248 | 11.9% | |
| e | 14048 | 11.0% | |
| r | 10014 | 7.8% | |
| n | 9819 | 7.7% | |
| i | 9228 | 7.2% | |
| l | 7403 | 5.8% | |
| o | 6262 | 4.9% | |
| t | 4850 | 3.8% | |
| h | 4723 | 3.7% | |
| s | 4097 | 3.2% | |
| y | 4002 | 3.1% | |
| d | 3281 | 2.6% | |
| J | 2962 | 2.3% | |
| c | 2702 | 2.1% | |
| m | 2316 | 1.8% | |
| M | 2197 | 1.7% | |
| u | 1857 | 1.5% | |
| R | 1751 | 1.4% | |
| D | 1645 | 1.3% | |
| C | 1574 | 1.2% | |
| A | 1406 | 1.1% | |
| S | 1379 | 1.1% | |
| b | 1354 | 1.1% | |
| L | 1235 | 1.0% | |
| B | 1061 | 0.8% | |
| Other values (27) | 11536 | 9.0% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 127950 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| a | 15248 | 11.9% | |
| e | 14048 | 11.0% | |
| r | 10014 | 7.8% | |
| n | 9819 | 7.7% | |
| i | 9228 | 7.2% | |
| l | 7403 | 5.8% | |
| o | 6262 | 4.9% | |
| t | 4850 | 3.8% | |
| h | 4723 | 3.7% | |
| s | 4097 | 3.2% | |
| y | 4002 | 3.1% | |
| d | 3281 | 2.6% | |
| J | 2962 | 2.3% | |
| c | 2702 | 2.1% | |
| m | 2316 | 1.8% | |
| M | 2197 | 1.7% | |
| u | 1857 | 1.5% | |
| R | 1751 | 1.4% | |
| D | 1645 | 1.3% | |
| C | 1574 | 1.2% | |
| A | 1406 | 1.1% | |
| S | 1379 | 1.1% | |
| b | 1354 | 1.1% | |
| L | 1235 | 1.0% | |
| B | 1061 | 0.8% | |
| Other values (27) | 11536 | 9.0% |
| Distinct | 6282 |
|---|---|
| Distinct (%) | 50.7% |
| Missing | 9691 |
| Missing (%) | 43.9% |
| Memory size | 172.6 KiB |
| Smith | 157 |
|---|---|
| Williams | 106 |
| Johnson | 99 |
| Brown | 90 |
| Jones | 81 |
| Other values (6277) |
| Value | Count | Frequency (%) | |
| Smith | 157 | 0.7% | |
| Williams | 106 | 0.5% | |
| Johnson | 99 | 0.4% | |
| Brown | 90 | 0.4% | |
| Jones | 81 | 0.4% | |
| Davis | 61 | 0.3% | |
| Miller | 54 | 0.2% | |
| Wilson | 50 | 0.2% | |
| White | 49 | 0.2% | |
| Harris | 48 | 0.2% | |
| Jackson | 47 | 0.2% | |
| Anderson | 38 | 0.2% | |
| Taylor | 38 | 0.2% | |
| Martin | 36 | 0.2% | |
| Moore | 36 | 0.2% | |
| Garcia | 35 | 0.2% | |
| Thomas | 34 | 0.2% | |
| Walker | 34 | 0.2% | |
| Lee | 33 | 0.1% | |
| Clark | 31 | 0.1% | |
| Young | 31 | 0.1% | |
| Hall | 30 | 0.1% | |
| Martinez | 29 | 0.1% | |
| Mitchell | 28 | 0.1% | |
| Roberts | 27 | 0.1% | |
| Other values (6257) | 11090 | 50.2% | |
| (Missing) | 9691 | 43.9% |
Frequencies of value counts
Unique
| Unique | 4556 ? |
|---|---|
| Unique (%) | 36.8% |
Histogram of lengths of the category
Length
| Max length | 13 |
|---|---|
| Median length | 4 |
| Mean length | 4.840239098 |
| Min length | 2 |
Most occurring characters
| Value | Count | Frequency (%) | |
| n | 25372 | 23.7% | |
| a | 16153 | 15.1% | |
| e | 8005 | 7.5% | |
| r | 6240 | 5.8% | |
| o | 5519 | 5.2% | |
| l | 4720 | 4.4% | |
| i | 4486 | 4.2% | |
| s | 4098 | 3.8% | |
| t | 3159 | 3.0% | |
| h | 2011 | 1.9% | |
| d | 1836 | 1.7% | |
| u | 1737 | 1.6% | |
| c | 1728 | 1.6% | |
| m | 1678 | 1.6% | |
| y | 1370 | 1.3% | |
| M | 1246 | 1.2% | |
| g | 1226 | 1.1% | |
| S | 1136 | 1.1% | |
| B | 1124 | 1.1% | |
| k | 1088 | 1.0% | |
| H | 956 | 0.9% | |
| C | 948 | 0.9% | |
| b | 799 | 0.7% | |
| w | 770 | 0.7% | |
| W | 767 | 0.7% | |
| Other values (27) | 8715 | 8.2% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 94495 | 88.4% | |
| Uppercase Letter | 12392 | 11.6% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| n | 25372 | 26.9% | |
| a | 16153 | 17.1% | |
| e | 8005 | 8.5% | |
| r | 6240 | 6.6% | |
| o | 5519 | 5.8% | |
| l | 4720 | 5.0% | |
| i | 4486 | 4.7% | |
| s | 4098 | 4.3% | |
| t | 3159 | 3.3% | |
| h | 2011 | 2.1% | |
| d | 1836 | 1.9% | |
| u | 1737 | 1.8% | |
| c | 1728 | 1.8% | |
| m | 1678 | 1.8% | |
| y | 1370 | 1.4% | |
| g | 1226 | 1.3% | |
| k | 1088 | 1.2% | |
| b | 799 | 0.8% | |
| w | 770 | 0.8% | |
| z | 695 | 0.7% | |
| p | 657 | 0.7% | |
| v | 521 | 0.6% | |
| f | 400 | 0.4% | |
| x | 115 | 0.1% | |
| j | 56 | 0.1% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| M | 1246 | 10.1% | |
| S | 1136 | 9.2% | |
| B | 1124 | 9.1% | |
| H | 956 | 7.7% | |
| C | 948 | 7.7% | |
| W | 767 | 6.2% | |
| R | 739 | 6.0% | |
| G | 646 | 5.2% | |
| P | 634 | 5.1% | |
| L | 559 | 4.5% | |
| D | 558 | 4.5% | |
| A | 438 | 3.5% | |
| F | 425 | 3.4% | |
| T | 418 | 3.4% | |
| J | 412 | 3.3% | |
| K | 370 | 3.0% | |
| N | 242 | 2.0% | |
| E | 218 | 1.8% | |
| O | 184 | 1.5% | |
| V | 156 | 1.3% | |
| Y | 63 | 0.5% | |
| Z | 58 | 0.5% | |
| I | 48 | 0.4% | |
| U | 29 | 0.2% | |
| Q | 16 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 106887 | 100.0% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| n | 25372 | 23.7% | |
| a | 16153 | 15.1% | |
| e | 8005 | 7.5% | |
| r | 6240 | 5.8% | |
| o | 5519 | 5.2% | |
| l | 4720 | 4.4% | |
| i | 4486 | 4.2% | |
| s | 4098 | 3.8% | |
| t | 3159 | 3.0% | |
| h | 2011 | 1.9% | |
| d | 1836 | 1.7% | |
| u | 1737 | 1.6% | |
| c | 1728 | 1.6% | |
| m | 1678 | 1.6% | |
| y | 1370 | 1.3% | |
| M | 1246 | 1.2% | |
| g | 1226 | 1.1% | |
| S | 1136 | 1.1% | |
| B | 1124 | 1.1% | |
| k | 1088 | 1.0% | |
| H | 956 | 0.9% | |
| C | 948 | 0.9% | |
| b | 799 | 0.7% | |
| w | 770 | 0.7% | |
| W | 767 | 0.7% | |
| Other values (27) | 8715 | 8.2% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 106887 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| n | 25372 | 23.7% | |
| a | 16153 | 15.1% | |
| e | 8005 | 7.5% | |
| r | 6240 | 5.8% | |
| o | 5519 | 5.2% | |
| l | 4720 | 4.4% | |
| i | 4486 | 4.2% | |
| s | 4098 | 3.8% | |
| t | 3159 | 3.0% | |
| h | 2011 | 1.9% | |
| d | 1836 | 1.7% | |
| u | 1737 | 1.6% | |
| c | 1728 | 1.6% | |
| m | 1678 | 1.6% | |
| y | 1370 | 1.3% | |
| M | 1246 | 1.2% | |
| g | 1226 | 1.1% | |
| S | 1136 | 1.1% | |
| B | 1124 | 1.1% | |
| k | 1088 | 1.0% | |
| H | 956 | 0.9% | |
| C | 948 | 0.9% | |
| b | 799 | 0.7% | |
| w | 770 | 0.7% | |
| W | 767 | 0.7% | |
| Other values (27) | 8715 | 8.2% |
| Distinct | 16368 |
|---|---|
| Distinct (%) | 74.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 172.6 KiB |
| Clardie | 6 |
|---|---|
| Sager | 6 |
| Muhib | 5 |
| Hafiz | 5 |
| Nashon | 5 |
| Other values (16363) |
| Value | Count | Frequency (%) | |
| Clardie | 6 | < 0.1% | |
| Sager | 6 | < 0.1% | |
| Muhib | 5 | < 0.1% | |
| Hafiz | 5 | < 0.1% | |
| Nashon | 5 | < 0.1% | |
| Wilder | 5 | < 0.1% | |
| Lucciano | 5 | < 0.1% | |
| Marell | 5 | < 0.1% | |
| Edwen | 5 | < 0.1% | |
| Daiquon | 5 | < 0.1% | |
| True | 5 | < 0.1% | |
| Buzzy | 5 | < 0.1% | |
| Daric | 4 | < 0.1% | |
| Williaa | 4 | < 0.1% | |
| Royston | 4 | < 0.1% | |
| Maurisio | 4 | < 0.1% | |
| Ajai | 4 | < 0.1% | |
| Dhahran | 4 | < 0.1% | |
| Zantavious | 4 | < 0.1% | |
| Linwood | 4 | < 0.1% | |
| Kieffer | 4 | < 0.1% | |
| Trezden | 4 | < 0.1% | |
| Olufemi | 4 | < 0.1% | |
| Reiter | 4 | < 0.1% | |
| Jamesanthony | 4 | < 0.1% | |
| Other values (16343) | 21969 | 99.5% |
Frequencies of value counts
Unique
| Unique | 11741 ? |
|---|---|
| Unique (%) | 53.2% |
Histogram of lengths of the category
Length
| Max length | 15 |
|---|---|
| Median length | 6 |
| Mean length | 6.289181723 |
| Min length | 2 |
Most occurring characters
| Value | Count | Frequency (%) | |
| a | 16480 | 11.9% | |
| e | 13599 | 9.8% | |
| n | 11672 | 8.4% | |
| i | 9951 | 7.2% | |
| r | 9939 | 7.2% | |
| o | 8266 | 6.0% | |
| l | 6625 | 4.8% | |
| s | 5016 | 3.6% | |
| h | 4534 | 3.3% | |
| y | 3960 | 2.9% | |
| d | 3846 | 2.8% | |
| t | 3842 | 2.8% | |
| u | 3673 | 2.6% | |
| m | 3371 | 2.4% | |
| J | 2309 | 1.7% | |
| c | 2065 | 1.5% | |
| A | 1946 | 1.4% | |
| D | 1894 | 1.4% | |
| v | 1868 | 1.3% | |
| k | 1841 | 1.3% | |
| K | 1600 | 1.2% | |
| T | 1473 | 1.1% | |
| M | 1368 | 1.0% | |
| S | 1343 | 1.0% | |
| C | 1247 | 0.9% | |
| Other values (27) | 15156 | 10.9% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 116801 | 84.1% | |
| Uppercase Letter | 22083 | 15.9% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| J | 2309 | 10.5% | |
| A | 1946 | 8.8% | |
| D | 1894 | 8.6% | |
| K | 1600 | 7.2% | |
| T | 1473 | 6.7% | |
| M | 1368 | 6.2% | |
| S | 1343 | 6.1% | |
| C | 1247 | 5.6% | |
| R | 1203 | 5.4% | |
| B | 948 | 4.3% | |
| L | 920 | 4.2% | |
| E | 834 | 3.8% | |
| N | 651 | 2.9% | |
| H | 606 | 2.7% | |
| G | 594 | 2.7% | |
| Z | 496 | 2.2% | |
| O | 391 | 1.8% | |
| W | 361 | 1.6% | |
| P | 329 | 1.5% | |
| Y | 327 | 1.5% | |
| F | 313 | 1.4% | |
| V | 296 | 1.3% | |
| I | 288 | 1.3% | |
| Q | 194 | 0.9% | |
| U | 85 | 0.4% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| a | 16480 | 14.1% | |
| e | 13599 | 11.6% | |
| n | 11672 | 10.0% | |
| i | 9951 | 8.5% | |
| r | 9939 | 8.5% | |
| o | 8266 | 7.1% | |
| l | 6625 | 5.7% | |
| s | 5016 | 4.3% | |
| h | 4534 | 3.9% | |
| y | 3960 | 3.4% | |
| d | 3846 | 3.3% | |
| t | 3842 | 3.3% | |
| u | 3673 | 3.1% | |
| m | 3371 | 2.9% | |
| c | 2065 | 1.8% | |
| v | 1868 | 1.6% | |
| k | 1841 | 1.6% | |
| b | 1184 | 1.0% | |
| g | 903 | 0.8% | |
| z | 898 | 0.8% | |
| w | 768 | 0.7% | |
| j | 650 | 0.6% | |
| f | 623 | 0.5% | |
| p | 509 | 0.4% | |
| q | 420 | 0.4% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 138884 | 100.0% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| a | 16480 | 11.9% | |
| e | 13599 | 9.8% | |
| n | 11672 | 8.4% | |
| i | 9951 | 7.2% | |
| r | 9939 | 7.2% | |
| o | 8266 | 6.0% | |
| l | 6625 | 4.8% | |
| s | 5016 | 3.6% | |
| h | 4534 | 3.3% | |
| y | 3960 | 2.9% | |
| d | 3846 | 2.8% | |
| t | 3842 | 2.8% | |
| u | 3673 | 2.6% | |
| m | 3371 | 2.4% | |
| J | 2309 | 1.7% | |
| c | 2065 | 1.5% | |
| A | 1946 | 1.4% | |
| D | 1894 | 1.4% | |
| v | 1868 | 1.3% | |
| k | 1841 | 1.3% | |
| K | 1600 | 1.2% | |
| T | 1473 | 1.1% | |
| M | 1368 | 1.0% | |
| S | 1343 | 1.0% | |
| C | 1247 | 0.9% | |
| Other values (27) | 15156 | 10.9% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 138884 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| a | 16480 | 11.9% | |
| e | 13599 | 9.8% | |
| n | 11672 | 8.4% | |
| i | 9951 | 7.2% | |
| r | 9939 | 7.2% | |
| o | 8266 | 6.0% | |
| l | 6625 | 4.8% | |
| s | 5016 | 3.6% | |
| h | 4534 | 3.3% | |
| y | 3960 | 2.9% | |
| d | 3846 | 2.8% | |
| t | 3842 | 2.8% | |
| u | 3673 | 2.6% | |
| m | 3371 | 2.4% | |
| J | 2309 | 1.7% | |
| c | 2065 | 1.5% | |
| A | 1946 | 1.4% | |
| D | 1894 | 1.4% | |
| v | 1868 | 1.3% | |
| k | 1841 | 1.3% | |
| K | 1600 | 1.2% | |
| T | 1473 | 1.1% | |
| M | 1368 | 1.0% | |
| S | 1343 | 1.0% | |
| C | 1247 | 0.9% | |
| Other values (27) | 15156 | 10.9% |
| Distinct | 34 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 6036 |
| Missing (%) | 27.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 34.52645354 |
|---|---|
| Minimum | 18 |
| Maximum | 51 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 172.6 KiB |
Quantile statistics
| Minimum | 18 |
|---|---|
| 5-th percentile | 19 |
| Q1 | 26 |
| median | 35 |
| Q3 | 43 |
| 95-th percentile | 50 |
| Maximum | 51 |
| Range | 33 |
| Interquartile range (IQR) | 17 |
Descriptive statistics
| Standard deviation | 9.852598421 |
|---|---|
| Coefficient of variation (CV) | 0.2853637548 |
| Kurtosis | -1.223088083 |
| Mean | 34.52645354 |
| Median Absolute Deviation (MAD) | 9 |
| Skewness | -0.005153871873 |
| Sum | 554046 |
| Variance | 97.07369565 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=34)
| Value | Count | Frequency (%) | |
| 23 | 525 | 2.4% | |
| 19 | 516 | 2.3% | |
| 40 | 515 | 2.3% | |
| 28 | 508 | 2.3% | |
| 47 | 508 | 2.3% | |
| 48 | 507 | 2.3% | |
| 41 | 502 | 2.3% | |
| 45 | 490 | 2.2% | |
| 44 | 489 | 2.2% | |
| 21 | 488 | 2.2% | |
| 35 | 484 | 2.2% | |
| 24 | 480 | 2.2% | |
| 49 | 479 | 2.2% | |
| 50 | 476 | 2.2% | |
| 30 | 473 | 2.1% | |
| 27 | 471 | 2.1% | |
| 29 | 469 | 2.1% | |
| 32 | 465 | 2.1% | |
| 38 | 465 | 2.1% | |
| 42 | 464 | 2.1% | |
| 37 | 463 | 2.1% | |
| 22 | 463 | 2.1% | |
| 46 | 460 | 2.1% | |
| 26 | 457 | 2.1% | |
| 31 | 457 | 2.1% | |
| Other values (9) | 3973 | 18.0% | |
| (Missing) | 6036 | 27.3% |
| Value | Count | Frequency (%) | |
| 18 | 443 | 2.0% | |
| 19 | 516 | 2.3% | |
| 20 | 451 | 2.0% | |
| 21 | 488 | 2.2% | |
| 22 | 463 | 2.1% | |
| 23 | 525 | 2.4% | |
| 24 | 480 | 2.2% | |
| 25 | 435 | 2.0% | |
| 26 | 457 | 2.1% | |
| 27 | 471 | 2.1% |
| Value | Count | Frequency (%) | |
| 51 | 449 | 2.0% | |
| 50 | 476 | 2.2% | |
| 49 | 479 | 2.2% | |
| 48 | 507 | 2.3% | |
| 47 | 508 | 2.3% | |
| 46 | 460 | 2.1% | |
| 45 | 490 | 2.2% | |
| 44 | 489 | 2.2% | |
| 43 | 437 | 2.0% | |
| 42 | 464 | 2.1% |
| Distinct | 45 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 5986 |
| Missing (%) | 27.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 41.97285208 |
|---|---|
| Minimum | 20 |
| Maximum | 64 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 172.6 KiB |
Quantile statistics
| Minimum | 20 |
|---|---|
| 5-th percentile | 22 |
| Q1 | 31 |
| median | 42 |
| Q3 | 53 |
| 95-th percentile | 62 |
| Maximum | 64 |
| Range | 44 |
| Interquartile range (IQR) | 22 |
Descriptive statistics
| Standard deviation | 13.03550058 |
|---|---|
| Coefficient of variation (CV) | 0.3105698072 |
| Kurtosis | -1.213855517 |
| Mean | 41.97285208 |
| Median Absolute Deviation (MAD) | 11 |
| Skewness | -0.005839712015 |
| Sum | 675637 |
| Variance | 169.9242754 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=45)
| Value | Count | Frequency (%) | |
| 20 | 414 | 1.9% | |
| 49 | 400 | 1.8% | |
| 29 | 399 | 1.8% | |
| 61 | 396 | 1.8% | |
| 57 | 381 | 1.7% | |
| 39 | 380 | 1.7% | |
| 56 | 378 | 1.7% | |
| 53 | 377 | 1.7% | |
| 27 | 377 | 1.7% | |
| 30 | 377 | 1.7% | |
| 44 | 372 | 1.7% | |
| 26 | 371 | 1.7% | |
| 38 | 370 | 1.7% | |
| 52 | 369 | 1.7% | |
| 64 | 368 | 1.7% | |
| 32 | 365 | 1.7% | |
| 40 | 363 | 1.6% | |
| 37 | 363 | 1.6% | |
| 51 | 363 | 1.6% | |
| 59 | 361 | 1.6% | |
| 23 | 360 | 1.6% | |
| 28 | 358 | 1.6% | |
| 58 | 358 | 1.6% | |
| 50 | 357 | 1.6% | |
| 31 | 356 | 1.6% | |
| Other values (20) | 6764 | 30.6% | |
| (Missing) | 5986 | 27.1% |
| Value | Count | Frequency (%) | |
| 20 | 414 | 1.9% | |
| 21 | 350 | 1.6% | |
| 22 | 339 | 1.5% | |
| 23 | 360 | 1.6% | |
| 24 | 349 | 1.6% | |
| 25 | 323 | 1.5% | |
| 26 | 371 | 1.7% | |
| 27 | 377 | 1.7% | |
| 28 | 358 | 1.6% | |
| 29 | 399 | 1.8% |
| Value | Count | Frequency (%) | |
| 64 | 368 | 1.7% | |
| 63 | 314 | 1.4% | |
| 62 | 347 | 1.6% | |
| 61 | 396 | 1.8% | |
| 60 | 345 | 1.6% | |
| 59 | 361 | 1.6% | |
| 58 | 358 | 1.6% | |
| 57 | 381 | 1.7% | |
| 56 | 378 | 1.7% | |
| 55 | 347 | 1.6% |
| Distinct | 27 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 5106 |
| Missing (%) | 23.1% |
| Memory size | 172.6 KiB |
| Not applicable | |
|---|---|
| Franciscan Children's Hospital | 363 |
| Carney Hospital | 357 |
| New England Medical Center | 350 |
| Hebrew Rehabilitation Center | 349 |
| Other values (22) |
| Value | Count | Frequency (%) | |
| Not applicable | 8440 | 38.2% | |
| Franciscan Children's Hospital | 363 | 1.6% | |
| Carney Hospital | 357 | 1.6% | |
| New England Medical Center | 350 | 1.6% | |
| Hebrew Rehabilitation Center | 349 | 1.6% | |
| VA Hospital | 344 | 1.6% | |
| Shriners Burns Institute | 341 | 1.5% | |
| Massachusetts Eye & Ear Infirmary | 337 | 1.5% | |
| Brigham And Women's Hospital | 334 | 1.5% | |
| Boston City Hospital | 330 | 1.5% | |
| St. Margaret's Hospital For Women | 329 | 1.5% | |
| Arbour Hospital | 327 | 1.5% | |
| Spaulding Rehabilitation Hospital | 325 | 1.5% | |
| Faulkner Hospital | 325 | 1.5% | |
| Children's Hospital | 324 | 1.5% | |
| Kindred Hospital | 324 | 1.5% | |
| Dana-farber Cancer Institute | 323 | 1.5% | |
| Boston Specialty & Rehabilitation Hospital | 322 | 1.5% | |
| Massachusetts General Hospital | 321 | 1.5% | |
| Beth Israel Deaconess Medical Center East Cam | 320 | 1.4% | |
| Boston Medical Center | 318 | 1.4% | |
| New England Baptist Hospital | 317 | 1.4% | |
| Jewish Memorial Hospital | 315 | 1.4% | |
| Beth Israel Deaconess Medical Center West Cam | 315 | 1.4% | |
| Lemuel Shattuck Hospital | 313 | 1.4% | |
| Other values (2) | 614 | 2.8% | |
| (Missing) | 5106 | 23.1% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 45 |
|---|---|
| Median length | 14 |
| Mean length | 16.00461894 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| a | 42266 | 12.0% | |
| l | 29275 | 8.3% | |
| 29081 | 8.2% | ||
| t | 26690 | 7.6% | |
| e | 26116 | 7.4% | |
| i | 23891 | 6.8% | |
| p | 23728 | 6.7% | |
| n | 22190 | 6.3% | |
| o | 19529 | 5.5% | |
| s | 16019 | 4.5% | |
| c | 12720 | 3.6% | |
| b | 10737 | 3.0% | |
| r | 10306 | 2.9% | |
| N | 9107 | 2.6% | |
| H | 6233 | 1.8% | |
| h | 4581 | 1.3% | |
| C | 3984 | 1.1% | |
| d | 3964 | 1.1% | |
| u | 3266 | 0.9% | |
| M | 2605 | 0.7% | |
| B | 2597 | 0.7% | |
| m | 2597 | 0.7% | |
| E | 1963 | 0.6% | |
| S | 1932 | 0.5% | |
| y | 1683 | 0.5% | |
| Other values (20) | 16370 | 4.6% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 284144 | 80.4% | |
| Uppercase Letter | 36940 | 10.5% | |
| Space Separator | 29081 | 8.2% | |
| Other Punctuation | 2942 | 0.8% | |
| Dash Punctuation | 323 | 0.1% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| N | 9107 | 24.7% | |
| H | 6233 | 16.9% | |
| C | 3984 | 10.8% | |
| M | 2605 | 7.1% | |
| B | 2597 | 7.0% | |
| E | 1963 | 5.3% | |
| S | 1932 | 5.2% | |
| I | 1636 | 4.4% | |
| F | 1017 | 2.8% | |
| A | 1005 | 2.7% | |
| R | 996 | 2.7% | |
| W | 978 | 2.6% | |
| D | 958 | 2.6% | |
| V | 656 | 1.8% | |
| K | 324 | 0.9% | |
| G | 321 | 0.9% | |
| J | 315 | 0.9% | |
| L | 313 | 0.8% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| a | 42266 | 14.9% | |
| l | 29275 | 10.3% | |
| t | 26690 | 9.4% | |
| e | 26116 | 9.2% | |
| i | 23891 | 8.4% | |
| p | 23728 | 8.4% | |
| n | 22190 | 7.8% | |
| o | 19529 | 6.9% | |
| s | 16019 | 5.6% | |
| c | 12720 | 4.5% | |
| b | 10737 | 3.8% | |
| r | 10306 | 3.6% | |
| h | 4581 | 1.6% | |
| d | 3964 | 1.4% | |
| u | 3266 | 1.1% | |
| m | 2597 | 0.9% | |
| y | 1683 | 0.6% | |
| g | 1655 | 0.6% | |
| w | 1331 | 0.5% | |
| f | 660 | 0.2% | |
| k | 638 | 0.2% | |
| z | 302 | 0.1% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 29081 | 100.0% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| ' | 1652 | 56.2% | |
| & | 659 | 22.4% | |
| . | 631 | 21.4% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 323 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 321084 | 90.8% | |
| Common | 32346 | 9.2% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| a | 42266 | 13.2% | |
| l | 29275 | 9.1% | |
| t | 26690 | 8.3% | |
| e | 26116 | 8.1% | |
| i | 23891 | 7.4% | |
| p | 23728 | 7.4% | |
| n | 22190 | 6.9% | |
| o | 19529 | 6.1% | |
| s | 16019 | 5.0% | |
| c | 12720 | 4.0% | |
| b | 10737 | 3.3% | |
| r | 10306 | 3.2% | |
| N | 9107 | 2.8% | |
| H | 6233 | 1.9% | |
| h | 4581 | 1.4% | |
| C | 3984 | 1.2% | |
| d | 3964 | 1.2% | |
| u | 3266 | 1.0% | |
| M | 2605 | 0.8% | |
| B | 2597 | 0.8% | |
| m | 2597 | 0.8% | |
| E | 1963 | 0.6% | |
| S | 1932 | 0.6% | |
| y | 1683 | 0.5% | |
| g | 1655 | 0.5% | |
| Other values (15) | 11450 | 3.6% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 29081 | 89.9% | ||
| ' | 1652 | 5.1% | |
| & | 659 | 2.0% | |
| . | 631 | 2.0% | |
| - | 323 | 1.0% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 353430 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| a | 42266 | 12.0% | |
| l | 29275 | 8.3% | |
| 29081 | 8.2% | ||
| t | 26690 | 7.6% | |
| e | 26116 | 7.4% | |
| i | 23891 | 6.8% | |
| p | 23728 | 6.7% | |
| n | 22190 | 6.3% | |
| o | 19529 | 5.5% | |
| s | 16019 | 4.5% | |
| c | 12720 | 3.6% | |
| b | 10737 | 3.0% | |
| r | 10306 | 2.9% | |
| N | 9107 | 2.6% | |
| H | 6233 | 1.8% | |
| h | 4581 | 1.3% | |
| C | 3984 | 1.1% | |
| d | 3964 | 1.1% | |
| u | 3266 | 0.9% | |
| M | 2605 | 0.7% | |
| B | 2597 | 0.7% | |
| m | 2597 | 0.7% | |
| E | 1963 | 0.6% | |
| S | 1932 | 0.5% | |
| y | 1683 | 0.5% | |
| Other values (20) | 16370 | 4.6% |
| Distinct | 26 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 172.6 KiB |
| - | |
|---|---|
| 125 PARKER HILL AV JAMAICA PLAIN, MA 02120 (42.329611374844326, -71.10616871232227) | 864 |
| 249 RIVER ST MATTAPAN, MA 02126 (42.27137912172521, -71.08168028446168) | 466 |
| 2100 DORCHESTER AV DORCHESTER, MA 02124 (42.27854306401838, -71.06631280050811) | 458 |
| 1200 Centre St Roslindale, MA 02131 (42.29738386053219, -71.13150465441208) | 455 |
| Other values (21) |
| Value | Count | Frequency (%) | |
| - | 10931 | 49.5% | |
| 125 PARKER HILL AV JAMAICA PLAIN, MA 02120 (42.329611374844326, -71.10616871232227) | 864 | 3.9% | |
| 249 RIVER ST MATTAPAN, MA 02126 (42.27137912172521, -71.08168028446168) | 466 | 2.1% | |
| 2100 DORCHESTER AV DORCHESTER, MA 02124 (42.27854306401838, -71.06631280050811) | 458 | 2.1% | |
| 1200 Centre St Roslindale, MA 02131 (42.29738386053219, -71.13150465441208) | 455 | 2.1% | |
| 51 BLOSSOM ST CENTRAL, MA 02114 (42.36327718561898, -71.0668523937257) | 451 | 2.0% | |
| 736 CAMBRIDGE ST ALLSTON/BRIGHTON, MA 02135 (42.349656455743144, -71.14822103232248) | 446 | 2.0% | |
| 75 FRANCIS ST FENWAY/KENMORE, MA 02115 (42.33587602903896, -71.10741054246668) | 443 | 2.0% | |
| 818 HARRISON AV SOUTH END, MA 02118 (42.335925371008436, -71.07378404269969) | 443 | 2.0% | |
| 59 TOWNSEND ST ROXBURY, MA 02119 (42.31856289432221, -71.09165569529381) | 442 | 2.0% | |
| 1400 VFW Parkway West Roxbury, MA 02132 (42.27598935537618, -71.17245195460838) | 441 | 2.0% | |
| 90 CUSHING AV DORCHESTER, MA 02125 (42.314030311294516, -71.06406449543488) | 440 | 2.0% | |
| 300 LONGWOOD AV FENWAY/KENMORE, MA 02115 (42.337592548462226, -71.10472284437952) | 436 | 2.0% | |
| 30 WARREN ST ALLSTON/BRIGHTON, MA 02134 (42.352620000312925, -71.13281000028115) | 434 | 2.0% | |
| 185 PILGRIM RD FENWAY/KENMORE, MA 02215 (42.3385289546495, -71.10940050507557) | 433 | 2.0% | |
| 44 BINNEY ST FENWAY/KENMORE, MA 02115 (42.33734993862189, -71.1071702648531) | 423 | 1.9% | |
| 125 NASHUA ST CENTRAL, MA 02114 (42.36764789068138, -71.06564730220646) | 421 | 1.9% | |
| 750 WASHINGTON ST CENTRAL, MA 02111 (42.349946522039204, -71.0634111017112) | 420 | 1.9% | |
| 330 BROOKLINE AV FENWAY/KENMORE, MA 02115 (42.3438499996779, -71.08983000035408) | 417 | 1.9% | |
| 1515 COMMONWEALTH AV ALLSTON/BRIGHTON, MA 02135 (42.34665771451756, -71.14136122385321) | 417 | 1.9% | |
| 55 FRUIT ST CENTRAL, MA 02114 (42.36247485742686, -71.06924724545246) | 412 | 1.9% | |
| 243 CHARLES ST CENTRAL, MA 02114 (42.36297141612903, -71.07043169540236) | 412 | 1.9% | |
| 88 EAST NEWTON ST SOUTH END, MA 02118 (42.3371094801158, -71.07139912234962) | 403 | 1.8% | |
| 49 ROBINWOOD AV JAMAICA PLAIN, MA 02130 (42.31617666213941, -71.11272670363542) | 402 | 1.8% | |
| 170 MORTON ST ROSLINDALE, MA 02130 (42.30025000839615, -71.10737910445549) | 388 | 1.8% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 87 |
|---|---|
| Median length | 69 |
| Mean length | 39.35891863 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 1 | 71631 | 8.2% | |
| 2 | 66683 | 7.7% | |
| 59965 | 6.9% | ||
| 0 | 52575 | 6.0% | |
| 4 | 48960 | 5.6% | |
| 3 | 44021 | 5.1% | |
| 7 | 35984 | 4.1% | |
| 5 | 34800 | 4.0% | |
| 6 | 34341 | 4.0% | |
| A | 34108 | 3.9% | |
| 8 | 30265 | 3.5% | |
| 9 | 26062 | 3.0% | |
| 22304 | 2.6% | ||
| , | 22304 | 2.6% | |
| . | 22304 | 2.6% | |
| - | 22083 | 2.5% | |
| N | 20367 | 2.3% | |
| R | 19030 | 2.2% | |
| E | 18415 | 2.1% | |
| M | 17973 | 2.1% | |
| T | 17060 | 2.0% | |
| O | 15346 | 1.8% | |
| S | 14614 | 1.7% | |
| L | 11431 | 1.3% | |
| ( | 11152 | 1.3% | |
| Other values (33) | 95385 | 11.0% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 445322 | 51.2% | |
| Uppercase Letter | 235688 | 27.1% | |
| Space Separator | 59965 | 6.9% | |
| Other Punctuation | 48057 | 5.5% | |
| Control | 22304 | 2.6% | |
| Dash Punctuation | 22083 | 2.5% | |
| Lowercase Letter | 13440 | 1.5% | |
| Open Punctuation | 11152 | 1.3% | |
| Close Punctuation | 11152 | 1.3% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 1 | 71631 | 16.1% | |
| 2 | 66683 | 15.0% | |
| 0 | 52575 | 11.8% | |
| 4 | 48960 | 11.0% | |
| 3 | 44021 | 9.9% | |
| 7 | 35984 | 8.1% | |
| 5 | 34800 | 7.8% | |
| 6 | 34341 | 7.7% | |
| 8 | 30265 | 6.8% | |
| 9 | 26062 | 5.9% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 59965 | 100.0% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| A | 34108 | 14.5% | |
| N | 20367 | 8.6% | |
| R | 19030 | 8.1% | |
| E | 18415 | 7.8% | |
| M | 17973 | 7.6% | |
| T | 17060 | 7.2% | |
| O | 15346 | 6.5% | |
| S | 14614 | 6.2% | |
| L | 11431 | 4.9% | |
| I | 11029 | 4.7% | |
| C | 8121 | 3.4% | |
| H | 6916 | 2.9% | |
| W | 5988 | 2.5% | |
| V | 4784 | 2.0% | |
| D | 4749 | 2.0% | |
| B | 3878 | 1.6% | |
| P | 3855 | 1.6% | |
| G | 3472 | 1.5% | |
| F | 3448 | 1.5% | |
| K | 3433 | 1.5% | |
| Y | 3017 | 1.3% | |
| U | 2561 | 1.1% | |
| J | 1651 | 0.7% | |
| X | 442 | 0.2% |
Most frequent Control characters
| Value | Count | Frequency (%) | |
| 22304 | 100.0% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| , | 22304 | 46.4% | |
| . | 22304 | 46.4% | |
| / | 3449 | 7.2% |
Most frequent Open Punctuation characters
| Value | Count | Frequency (%) | |
| ( | 11152 | 100.0% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 22083 | 100.0% |
Most frequent Close Punctuation characters
| Value | Count | Frequency (%) | |
| ) | 11152 | 100.0% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| e | 1806 | 13.4% | |
| t | 1351 | 10.1% | |
| r | 1337 | 9.9% | |
| a | 1337 | 9.9% | |
| n | 910 | 6.8% | |
| l | 910 | 6.8% | |
| o | 896 | 6.7% | |
| s | 896 | 6.7% | |
| y | 882 | 6.6% | |
| i | 455 | 3.4% | |
| d | 455 | 3.4% | |
| k | 441 | 3.3% | |
| w | 441 | 3.3% | |
| x | 441 | 3.3% | |
| b | 441 | 3.3% | |
| u | 441 | 3.3% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 620035 | 71.3% | |
| Latin | 249128 | 28.7% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 1 | 71631 | 11.6% | |
| 2 | 66683 | 10.8% | |
| 59965 | 9.7% | ||
| 0 | 52575 | 8.5% | |
| 4 | 48960 | 7.9% | |
| 3 | 44021 | 7.1% | |
| 7 | 35984 | 5.8% | |
| 5 | 34800 | 5.6% | |
| 6 | 34341 | 5.5% | |
| 8 | 30265 | 4.9% | |
| 9 | 26062 | 4.2% | |
| 22304 | 3.6% | ||
| , | 22304 | 3.6% | |
| . | 22304 | 3.6% | |
| - | 22083 | 3.6% | |
| ( | 11152 | 1.8% | |
| ) | 11152 | 1.8% | |
| / | 3449 | 0.6% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| A | 34108 | 13.7% | |
| N | 20367 | 8.2% | |
| R | 19030 | 7.6% | |
| E | 18415 | 7.4% | |
| M | 17973 | 7.2% | |
| T | 17060 | 6.8% | |
| O | 15346 | 6.2% | |
| S | 14614 | 5.9% | |
| L | 11431 | 4.6% | |
| I | 11029 | 4.4% | |
| C | 8121 | 3.3% | |
| H | 6916 | 2.8% | |
| W | 5988 | 2.4% | |
| V | 4784 | 1.9% | |
| D | 4749 | 1.9% | |
| B | 3878 | 1.6% | |
| P | 3855 | 1.5% | |
| G | 3472 | 1.4% | |
| F | 3448 | 1.4% | |
| K | 3433 | 1.4% | |
| Y | 3017 | 1.2% | |
| U | 2561 | 1.0% | |
| e | 1806 | 0.7% | |
| J | 1651 | 0.7% | |
| t | 1351 | 0.5% | |
| Other values (15) | 10725 | 4.3% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 869163 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 1 | 71631 | 8.2% | |
| 2 | 66683 | 7.7% | |
| 59965 | 6.9% | ||
| 0 | 52575 | 6.0% | |
| 4 | 48960 | 5.6% | |
| 3 | 44021 | 5.1% | |
| 7 | 35984 | 4.1% | |
| 5 | 34800 | 4.0% | |
| 6 | 34341 | 4.0% | |
| A | 34108 | 3.9% | |
| 8 | 30265 | 3.5% | |
| 9 | 26062 | 3.0% | |
| 22304 | 2.6% | ||
| , | 22304 | 2.6% | |
| . | 22304 | 2.6% | |
| - | 22083 | 2.5% | |
| N | 20367 | 2.3% | |
| R | 19030 | 2.2% | |
| E | 18415 | 2.1% | |
| M | 17973 | 2.1% | |
| T | 17060 | 2.0% | |
| O | 15346 | 1.8% | |
| S | 14614 | 1.7% | |
| L | 11431 | 1.3% | |
| ( | 11152 | 1.3% | |
| Other values (33) | 95385 | 11.0% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 172.6 KiB |
| Alive | |
|---|---|
| Deceased |
| Value | Count | Frequency (%) | |
| Alive | 11083 | 50.2% | |
| Deceased | 11000 | 49.8% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 8 |
|---|---|
| Median length | 5 |
| Mean length | 6.494362179 |
| Min length | 5 |
Most occurring characters
| Value | Count | Frequency (%) | |
| e | 44083 | 30.7% | |
| A | 11083 | 7.7% | |
| l | 11083 | 7.7% | |
| i | 11083 | 7.7% | |
| v | 11083 | 7.7% | |
| D | 11000 | 7.7% | |
| c | 11000 | 7.7% | |
| a | 11000 | 7.7% | |
| s | 11000 | 7.7% | |
| d | 11000 | 7.7% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 121332 | 84.6% | |
| Uppercase Letter | 22083 | 15.4% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| A | 11083 | 50.2% | |
| D | 11000 | 49.8% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| e | 44083 | 36.3% | |
| l | 11083 | 9.1% | |
| i | 11083 | 9.1% | |
| v | 11083 | 9.1% | |
| c | 11000 | 9.1% | |
| a | 11000 | 9.1% | |
| s | 11000 | 9.1% | |
| d | 11000 | 9.1% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 143415 | 100.0% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| e | 44083 | 30.7% | |
| A | 11083 | 7.7% | |
| l | 11083 | 7.7% | |
| i | 11083 | 7.7% | |
| v | 11083 | 7.7% | |
| D | 11000 | 7.7% | |
| c | 11000 | 7.7% | |
| a | 11000 | 7.7% | |
| s | 11000 | 7.7% | |
| d | 11000 | 7.7% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 143415 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| e | 44083 | 30.7% | |
| A | 11083 | 7.7% | |
| l | 11083 | 7.7% | |
| i | 11083 | 7.7% | |
| v | 11083 | 7.7% | |
| D | 11000 | 7.7% | |
| c | 11000 | 7.7% | |
| a | 11000 | 7.7% | |
| s | 11000 | 7.7% | |
| d | 11000 | 7.7% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2149 |
| Missing (%) | 9.7% |
| Memory size | 172.6 KiB |
| Normal (30-60) | |
|---|---|
| Tachypnea |
| Value | Count | Frequency (%) | |
| Normal (30-60) | 10065 | 45.6% | |
| Tachypnea | 9869 | 44.7% | |
| (Missing) | 2149 | 9.7% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 14 |
|---|---|
| Median length | 9 |
| Mean length | 10.69501426 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| a | 31952 | 13.5% | |
| 0 | 20130 | 8.5% | |
| n | 14167 | 6.0% | |
| N | 10065 | 4.3% | |
| o | 10065 | 4.3% | |
| r | 10065 | 4.3% | |
| m | 10065 | 4.3% | |
| l | 10065 | 4.3% | |
| 10065 | 4.3% | ||
| ( | 10065 | 4.3% | |
| 3 | 10065 | 4.3% | |
| - | 10065 | 4.3% | |
| 6 | 10065 | 4.3% | |
| ) | 10065 | 4.3% | |
| T | 9869 | 4.2% | |
| c | 9869 | 4.2% | |
| h | 9869 | 4.2% | |
| y | 9869 | 4.2% | |
| p | 9869 | 4.2% | |
| e | 9869 | 4.2% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 135724 | 57.5% | |
| Decimal Number | 40260 | 17.0% | |
| Uppercase Letter | 19934 | 8.4% | |
| Space Separator | 10065 | 4.3% | |
| Open Punctuation | 10065 | 4.3% | |
| Dash Punctuation | 10065 | 4.3% | |
| Close Punctuation | 10065 | 4.3% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| N | 10065 | 50.5% | |
| T | 9869 | 49.5% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| a | 31952 | 23.5% | |
| n | 14167 | 10.4% | |
| o | 10065 | 7.4% | |
| r | 10065 | 7.4% | |
| m | 10065 | 7.4% | |
| l | 10065 | 7.4% | |
| c | 9869 | 7.3% | |
| h | 9869 | 7.3% | |
| y | 9869 | 7.3% | |
| p | 9869 | 7.3% | |
| e | 9869 | 7.3% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 10065 | 100.0% |
Most frequent Open Punctuation characters
| Value | Count | Frequency (%) | |
| ( | 10065 | 100.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 0 | 20130 | 50.0% | |
| 3 | 10065 | 25.0% | |
| 6 | 10065 | 25.0% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 10065 | 100.0% |
Most frequent Close Punctuation characters
| Value | Count | Frequency (%) | |
| ) | 10065 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 155658 | 65.9% | |
| Common | 80520 | 34.1% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| a | 31952 | 20.5% | |
| n | 14167 | 9.1% | |
| N | 10065 | 6.5% | |
| o | 10065 | 6.5% | |
| r | 10065 | 6.5% | |
| m | 10065 | 6.5% | |
| l | 10065 | 6.5% | |
| T | 9869 | 6.3% | |
| c | 9869 | 6.3% | |
| h | 9869 | 6.3% | |
| y | 9869 | 6.3% | |
| p | 9869 | 6.3% | |
| e | 9869 | 6.3% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 0 | 20130 | 25.0% | |
| 10065 | 12.5% | ||
| ( | 10065 | 12.5% | |
| 3 | 10065 | 12.5% | |
| - | 10065 | 12.5% | |
| 6 | 10065 | 12.5% | |
| ) | 10065 | 12.5% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 236178 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| a | 31952 | 13.5% | |
| 0 | 20130 | 8.5% | |
| n | 14167 | 6.0% | |
| N | 10065 | 4.3% | |
| o | 10065 | 4.3% | |
| r | 10065 | 4.3% | |
| m | 10065 | 4.3% | |
| l | 10065 | 4.3% | |
| 10065 | 4.3% | ||
| ( | 10065 | 4.3% | |
| 3 | 10065 | 4.3% | |
| - | 10065 | 4.3% | |
| 6 | 10065 | 4.3% | |
| ) | 10065 | 4.3% | |
| T | 9869 | 4.2% | |
| c | 9869 | 4.2% | |
| h | 9869 | 4.2% | |
| y | 9869 | 4.2% | |
| p | 9869 | 4.2% | |
| e | 9869 | 4.2% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2113 |
| Missing (%) | 9.6% |
| Memory size | 172.6 KiB |
| Normal | |
|---|---|
| Tachycardia |
| Value | Count | Frequency (%) | |
| Normal | 10187 | 46.1% | |
| Tachycardia | 9783 | 44.3% | |
| (Missing) | 2113 | 9.6% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 11 |
|---|---|
| Median length | 6 |
| Mean length | 7.927998913 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| a | 41649 | 23.8% | |
| r | 19970 | 11.4% | |
| c | 19566 | 11.2% | |
| N | 10187 | 5.8% | |
| o | 10187 | 5.8% | |
| m | 10187 | 5.8% | |
| l | 10187 | 5.8% | |
| T | 9783 | 5.6% | |
| h | 9783 | 5.6% | |
| y | 9783 | 5.6% | |
| d | 9783 | 5.6% | |
| i | 9783 | 5.6% | |
| n | 4226 | 2.4% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 155104 | 88.6% | |
| Uppercase Letter | 19970 | 11.4% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| N | 10187 | 51.0% | |
| T | 9783 | 49.0% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| a | 41649 | 26.9% | |
| r | 19970 | 12.9% | |
| c | 19566 | 12.6% | |
| o | 10187 | 6.6% | |
| m | 10187 | 6.6% | |
| l | 10187 | 6.6% | |
| h | 9783 | 6.3% | |
| y | 9783 | 6.3% | |
| d | 9783 | 6.3% | |
| i | 9783 | 6.3% | |
| n | 4226 | 2.7% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 175074 | 100.0% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| a | 41649 | 23.8% | |
| r | 19970 | 11.4% | |
| c | 19566 | 11.2% | |
| N | 10187 | 5.8% | |
| o | 10187 | 5.8% | |
| m | 10187 | 5.8% | |
| l | 10187 | 5.8% | |
| T | 9783 | 5.6% | |
| h | 9783 | 5.6% | |
| y | 9783 | 5.6% | |
| d | 9783 | 5.6% | |
| i | 9783 | 5.6% | |
| n | 4226 | 2.4% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 175074 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| a | 41649 | 23.8% | |
| r | 19970 | 11.4% | |
| c | 19566 | 11.2% | |
| N | 10187 | 5.8% | |
| o | 10187 | 5.8% | |
| m | 10187 | 5.8% | |
| l | 10187 | 5.8% | |
| T | 9783 | 5.6% | |
| h | 9783 | 5.6% | |
| y | 9783 | 5.6% | |
| d | 9783 | 5.6% | |
| i | 9783 | 5.6% | |
| n | 4226 | 2.4% |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2127 |
| Missing (%) | 9.6% |
| Memory size | 172.6 KiB |
| 0 | |
|---|---|
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 19956 | 90.4% | |
| (Missing) | 2127 | 9.6% |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2152 |
| Missing (%) | 9.7% |
| Memory size | 172.6 KiB |
| 0 | |
|---|---|
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 19931 | 90.3% | |
| (Missing) | 2152 | 9.7% |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2147 |
| Missing (%) | 9.7% |
| Memory size | 172.6 KiB |
| 0 | |
|---|---|
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 19936 | 90.3% | |
| (Missing) | 2147 | 9.7% |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2140 |
| Missing (%) | 9.7% |
| Memory size | 172.6 KiB |
| 1 | |
|---|---|
| (Missing) |
| Value | Count | Frequency (%) | |
| 1 | 19943 | 90.3% | |
| (Missing) | 2140 | 9.7% |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2170 |
| Missing (%) | 9.8% |
| Memory size | 172.6 KiB |
| 0 | |
|---|---|
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 19913 | 90.2% | |
| (Missing) | 2170 | 9.8% |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2125 |
| Missing (%) | 9.6% |
| Memory size | 172.6 KiB |
| Yes |
|---|
| Value | Count | Frequency (%) | |
| Yes | 19958 | 90.4% | |
| (Missing) | 2125 | 9.6% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| Y | 19958 | 30.1% | |
| e | 19958 | 30.1% | |
| s | 19958 | 30.1% | |
| n | 4250 | 6.4% | |
| a | 2125 | 3.2% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 46291 | 69.9% | |
| Uppercase Letter | 19958 | 30.1% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| Y | 19958 | 100.0% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| e | 19958 | 43.1% | |
| s | 19958 | 43.1% | |
| n | 4250 | 9.2% | |
| a | 2125 | 4.6% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 66249 | 100.0% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| Y | 19958 | 30.1% | |
| e | 19958 | 30.1% | |
| s | 19958 | 30.1% | |
| n | 4250 | 6.4% | |
| a | 2125 | 3.2% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 66249 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| Y | 19958 | 30.1% | |
| e | 19958 | 30.1% | |
| s | 19958 | 30.1% | |
| n | 4250 | 6.4% | |
| a | 2125 | 3.2% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2166 |
| Missing (%) | 9.8% |
| Memory size | 172.6 KiB |
| Low | |
|---|---|
| High |
| Value | Count | Frequency (%) | |
| Low | 10040 | 45.5% | |
| High | 9877 | 44.7% | |
| (Missing) | 2166 | 9.8% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 4 |
|---|---|
| Median length | 3 |
| Mean length | 3.447267129 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| L | 10040 | 13.2% | |
| o | 10040 | 13.2% | |
| w | 10040 | 13.2% | |
| H | 9877 | 13.0% | |
| i | 9877 | 13.0% | |
| g | 9877 | 13.0% | |
| h | 9877 | 13.0% | |
| n | 4332 | 5.7% | |
| a | 2166 | 2.8% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 56209 | 73.8% | |
| Uppercase Letter | 19917 | 26.2% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| L | 10040 | 50.4% | |
| H | 9877 | 49.6% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| o | 10040 | 17.9% | |
| w | 10040 | 17.9% | |
| i | 9877 | 17.6% | |
| g | 9877 | 17.6% | |
| h | 9877 | 17.6% | |
| n | 4332 | 7.7% | |
| a | 2166 | 3.9% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 76126 | 100.0% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| L | 10040 | 13.2% | |
| o | 10040 | 13.2% | |
| w | 10040 | 13.2% | |
| H | 9877 | 13.0% | |
| i | 9877 | 13.0% | |
| g | 9877 | 13.0% | |
| h | 9877 | 13.0% | |
| n | 4332 | 5.7% | |
| a | 2166 | 2.8% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 76126 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| L | 10040 | 13.2% | |
| o | 10040 | 13.2% | |
| w | 10040 | 13.2% | |
| H | 9877 | 13.0% | |
| i | 9877 | 13.0% | |
| g | 9877 | 13.0% | |
| h | 9877 | 13.0% | |
| n | 4332 | 5.7% | |
| a | 2166 | 2.8% |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2173 |
| Missing (%) | 9.8% |
| Memory size | 172.6 KiB |
| Ambiguous | |
|---|---|
| Male | |
| Female |
| Value | Count | Frequency (%) | |
| Ambiguous | 6695 | 30.3% | |
| Male | 6666 | 30.2% | |
| Female | 6549 | 29.7% | |
| (Missing) | 2173 | 9.8% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 9 |
|---|---|
| Median length | 6 |
| Mean length | 6.010596386 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| e | 19764 | 14.9% | |
| a | 15388 | 11.6% | |
| u | 13390 | 10.1% | |
| m | 13244 | 10.0% | |
| l | 13215 | 10.0% | |
| A | 6695 | 5.0% | |
| b | 6695 | 5.0% | |
| i | 6695 | 5.0% | |
| g | 6695 | 5.0% | |
| o | 6695 | 5.0% | |
| s | 6695 | 5.0% | |
| M | 6666 | 5.0% | |
| F | 6549 | 4.9% | |
| n | 4346 | 3.3% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 112822 | 85.0% | |
| Uppercase Letter | 19910 | 15.0% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| e | 19764 | 17.5% | |
| a | 15388 | 13.6% | |
| u | 13390 | 11.9% | |
| m | 13244 | 11.7% | |
| l | 13215 | 11.7% | |
| b | 6695 | 5.9% | |
| i | 6695 | 5.9% | |
| g | 6695 | 5.9% | |
| o | 6695 | 5.9% | |
| s | 6695 | 5.9% | |
| n | 4346 | 3.9% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| A | 6695 | 33.6% | |
| M | 6666 | 33.5% | |
| F | 6549 | 32.9% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 132732 | 100.0% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| e | 19764 | 14.9% | |
| a | 15388 | 11.6% | |
| u | 13390 | 10.1% | |
| m | 13244 | 10.0% | |
| l | 13215 | 10.0% | |
| A | 6695 | 5.0% | |
| b | 6695 | 5.0% | |
| i | 6695 | 5.0% | |
| g | 6695 | 5.0% | |
| o | 6695 | 5.0% | |
| s | 6695 | 5.0% | |
| M | 6666 | 5.0% | |
| F | 6549 | 4.9% | |
| n | 4346 | 3.3% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 132732 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| e | 19764 | 14.9% | |
| a | 15388 | 11.6% | |
| u | 13390 | 10.1% | |
| m | 13244 | 10.0% | |
| l | 13215 | 10.0% | |
| A | 6695 | 5.0% | |
| b | 6695 | 5.0% | |
| i | 6695 | 5.0% | |
| g | 6695 | 5.0% | |
| o | 6695 | 5.0% | |
| s | 6695 | 5.0% | |
| M | 6666 | 5.0% | |
| F | 6549 | 4.9% | |
| n | 4346 | 3.3% |
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2139 |
| Missing (%) | 9.7% |
| Memory size | 172.6 KiB |
| Yes | |
|---|---|
| No record | |
| Not available | |
| No |
| Value | Count | Frequency (%) | |
| Yes | 5106 | 23.1% | |
| No record | 5008 | 22.7% | |
| Not available | 4986 | 22.6% | |
| No | 4844 | 21.9% | |
| (Missing) | 2139 | 9.7% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 13 |
|---|---|
| Median length | 3 |
| Mean length | 6.399175837 |
| Min length | 2 |
Most occurring characters
| Value | Count | Frequency (%) | |
| o | 19846 | 14.0% | |
| a | 17097 | 12.1% | |
| e | 15100 | 10.7% | |
| N | 14838 | 10.5% | |
| r | 10016 | 7.1% | |
| 9994 | 7.1% | ||
| l | 9972 | 7.1% | |
| Y | 5106 | 3.6% | |
| s | 5106 | 3.6% | |
| c | 5008 | 3.5% | |
| d | 5008 | 3.5% | |
| t | 4986 | 3.5% | |
| v | 4986 | 3.5% | |
| i | 4986 | 3.5% | |
| b | 4986 | 3.5% | |
| n | 4278 | 3.0% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 111375 | 78.8% | |
| Uppercase Letter | 19944 | 14.1% | |
| Space Separator | 9994 | 7.1% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| o | 19846 | 17.8% | |
| a | 17097 | 15.4% | |
| e | 15100 | 13.6% | |
| r | 10016 | 9.0% | |
| l | 9972 | 9.0% | |
| s | 5106 | 4.6% | |
| c | 5008 | 4.5% | |
| d | 5008 | 4.5% | |
| t | 4986 | 4.5% | |
| v | 4986 | 4.5% | |
| i | 4986 | 4.5% | |
| b | 4986 | 4.5% | |
| n | 4278 | 3.8% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| N | 14838 | 74.4% | |
| Y | 5106 | 25.6% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 9994 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 131319 | 92.9% | |
| Common | 9994 | 7.1% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| o | 19846 | 15.1% | |
| a | 17097 | 13.0% | |
| e | 15100 | 11.5% | |
| N | 14838 | 11.3% | |
| r | 10016 | 7.6% | |
| l | 9972 | 7.6% | |
| Y | 5106 | 3.9% | |
| s | 5106 | 3.9% | |
| c | 5008 | 3.8% | |
| d | 5008 | 3.8% | |
| t | 4986 | 3.8% | |
| v | 4986 | 3.8% | |
| i | 4986 | 3.8% | |
| b | 4986 | 3.8% | |
| n | 4278 | 3.3% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 9994 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 141313 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| o | 19846 | 14.0% | |
| a | 17097 | 12.1% | |
| e | 15100 | 10.7% | |
| N | 14838 | 10.5% | |
| r | 10016 | 7.1% | |
| 9994 | 7.1% | ||
| l | 9972 | 7.1% | |
| Y | 5106 | 3.6% | |
| s | 5106 | 3.6% | |
| c | 5008 | 3.5% | |
| d | 5008 | 3.5% | |
| t | 4986 | 3.5% | |
| v | 4986 | 3.5% | |
| i | 4986 | 3.5% | |
| b | 4986 | 3.5% | |
| n | 4278 | 3.0% |
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1026 |
| Missing (%) | 4.6% |
| Memory size | 172.6 KiB |
| Not applicable | |
|---|---|
| Yes | |
| None | |
| No |
| Value | Count | Frequency (%) | |
| Not applicable | 11083 | 50.2% | |
| Yes | 3383 | 15.3% | |
| None | 3366 | 15.2% | |
| No | 3225 | 14.6% | |
| (Missing) | 1026 | 4.6% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 14 |
|---|---|
| Median length | 14 |
| Mean length | 8.527057012 |
| Min length | 2 |
Most occurring characters
| Value | Count | Frequency (%) | |
| a | 23192 | 12.3% | |
| p | 22166 | 11.8% | |
| l | 22166 | 11.8% | |
| e | 17832 | 9.5% | |
| N | 17674 | 9.4% | |
| o | 17674 | 9.4% | |
| t | 11083 | 5.9% | |
| 11083 | 5.9% | ||
| i | 11083 | 5.9% | |
| c | 11083 | 5.9% | |
| b | 11083 | 5.9% | |
| n | 5418 | 2.9% | |
| Y | 3383 | 1.8% | |
| s | 3383 | 1.8% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 156163 | 82.9% | |
| Uppercase Letter | 21057 | 11.2% | |
| Space Separator | 11083 | 5.9% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| N | 17674 | 83.9% | |
| Y | 3383 | 16.1% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| a | 23192 | 14.9% | |
| p | 22166 | 14.2% | |
| l | 22166 | 14.2% | |
| e | 17832 | 11.4% | |
| o | 17674 | 11.3% | |
| t | 11083 | 7.1% | |
| i | 11083 | 7.1% | |
| c | 11083 | 7.1% | |
| b | 11083 | 7.1% | |
| n | 5418 | 3.5% | |
| s | 3383 | 2.2% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 11083 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 177220 | 94.1% | |
| Common | 11083 | 5.9% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| a | 23192 | 13.1% | |
| p | 22166 | 12.5% | |
| l | 22166 | 12.5% | |
| e | 17832 | 10.1% | |
| N | 17674 | 10.0% | |
| o | 17674 | 10.0% | |
| t | 11083 | 6.3% | |
| i | 11083 | 6.3% | |
| c | 11083 | 6.3% | |
| b | 11083 | 6.3% | |
| n | 5418 | 3.1% | |
| Y | 3383 | 1.9% | |
| s | 3383 | 1.9% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 11083 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 188303 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| a | 23192 | 12.3% | |
| p | 22166 | 11.8% | |
| l | 22166 | 11.8% | |
| e | 17832 | 9.5% | |
| N | 17674 | 9.4% | |
| o | 17674 | 9.4% | |
| t | 11083 | 5.9% | |
| 11083 | 5.9% | ||
| i | 11083 | 5.9% | |
| c | 11083 | 5.9% | |
| b | 11083 | 5.9% | |
| n | 5418 | 2.9% | |
| Y | 3383 | 1.8% | |
| s | 3383 | 1.8% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2124 |
| Missing (%) | 9.6% |
| Memory size | 172.6 KiB |
| Institute | |
|---|---|
| Home |
| Value | Count | Frequency (%) | |
| Institute | 10073 | 45.6% | |
| Home | 9886 | 44.8% | |
| (Missing) | 2124 | 9.6% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 9 |
|---|---|
| Median length | 4 |
| Mean length | 6.184531087 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| t | 30219 | 22.1% | |
| e | 19959 | 14.6% | |
| n | 14321 | 10.5% | |
| I | 10073 | 7.4% | |
| s | 10073 | 7.4% | |
| i | 10073 | 7.4% | |
| u | 10073 | 7.4% | |
| H | 9886 | 7.2% | |
| o | 9886 | 7.2% | |
| m | 9886 | 7.2% | |
| a | 2124 | 1.6% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 116614 | 85.4% | |
| Uppercase Letter | 19959 | 14.6% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| I | 10073 | 50.5% | |
| H | 9886 | 49.5% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| t | 30219 | 25.9% | |
| e | 19959 | 17.1% | |
| n | 14321 | 12.3% | |
| s | 10073 | 8.6% | |
| i | 10073 | 8.6% | |
| u | 10073 | 8.6% | |
| o | 9886 | 8.5% | |
| m | 9886 | 8.5% | |
| a | 2124 | 1.8% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 136573 | 100.0% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| t | 30219 | 22.1% | |
| e | 19959 | 14.6% | |
| n | 14321 | 10.5% | |
| I | 10073 | 7.4% | |
| s | 10073 | 7.4% | |
| i | 10073 | 7.4% | |
| u | 10073 | 7.4% | |
| H | 9886 | 7.2% | |
| o | 9886 | 7.2% | |
| m | 9886 | 7.2% | |
| a | 2124 | 1.6% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 136573 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| t | 30219 | 22.1% | |
| e | 19959 | 14.6% | |
| n | 14321 | 10.5% | |
| I | 10073 | 7.4% | |
| s | 10073 | 7.4% | |
| i | 10073 | 7.4% | |
| u | 10073 | 7.4% | |
| H | 9886 | 7.2% | |
| o | 9886 | 7.2% | |
| m | 9886 | 7.2% | |
| a | 2124 | 1.6% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2117 |
| Missing (%) | 9.6% |
| Memory size | 172.6 KiB |
| Yes | |
|---|---|
| No | |
| (Missing) |
| Value | Count | Frequency (%) | |
| Yes | 10087 | 45.7% | |
| No | 9879 | 44.7% | |
| (Missing) | 2117 | 9.6% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2152 |
| Missing (%) | 9.7% |
| Memory size | 172.6 KiB |
| No | |
|---|---|
| Yes | |
| (Missing) |
| Value | Count | Frequency (%) | |
| No | 10012 | 45.3% | |
| Yes | 9919 | 44.9% | |
| (Missing) | 2152 | 9.7% |
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2153 |
| Missing (%) | 9.7% |
| Memory size | 172.6 KiB |
| Not applicable | |
|---|---|
| No | |
| Yes | |
| - |
| Value | Count | Frequency (%) | |
| Not applicable | 5029 | 22.8% | |
| No | 5005 | 22.7% | |
| Yes | 4980 | 22.6% | |
| - | 4916 | 22.3% | |
| (Missing) | 2153 | 9.7% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 14 |
|---|---|
| Median length | 3 |
| Mean length | 4.83317484 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| a | 12211 | 11.4% | |
| p | 10058 | 9.4% | |
| l | 10058 | 9.4% | |
| N | 10034 | 9.4% | |
| o | 10034 | 9.4% | |
| e | 10009 | 9.4% | |
| t | 5029 | 4.7% | |
| 5029 | 4.7% | ||
| i | 5029 | 4.7% | |
| c | 5029 | 4.7% | |
| b | 5029 | 4.7% | |
| Y | 4980 | 4.7% | |
| s | 4980 | 4.7% | |
| - | 4916 | 4.6% | |
| n | 4306 | 4.0% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 81772 | 76.6% | |
| Uppercase Letter | 15014 | 14.1% | |
| Space Separator | 5029 | 4.7% | |
| Dash Punctuation | 4916 | 4.6% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| N | 10034 | 66.8% | |
| Y | 4980 | 33.2% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| a | 12211 | 14.9% | |
| p | 10058 | 12.3% | |
| l | 10058 | 12.3% | |
| o | 10034 | 12.3% | |
| e | 10009 | 12.2% | |
| t | 5029 | 6.2% | |
| i | 5029 | 6.2% | |
| c | 5029 | 6.2% | |
| b | 5029 | 6.2% | |
| s | 4980 | 6.1% | |
| n | 4306 | 5.3% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 5029 | 100.0% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 4916 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 96786 | 90.7% | |
| Common | 9945 | 9.3% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| a | 12211 | 12.6% | |
| p | 10058 | 10.4% | |
| l | 10058 | 10.4% | |
| N | 10034 | 10.4% | |
| o | 10034 | 10.4% | |
| e | 10009 | 10.3% | |
| t | 5029 | 5.2% | |
| i | 5029 | 5.2% | |
| c | 5029 | 5.2% | |
| b | 5029 | 5.2% | |
| Y | 4980 | 5.1% | |
| s | 4980 | 5.1% | |
| n | 4306 | 4.4% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 5029 | 50.6% | ||
| - | 4916 | 49.4% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 106731 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| a | 12211 | 11.4% | |
| p | 10058 | 9.4% | |
| l | 10058 | 9.4% | |
| N | 10034 | 9.4% | |
| o | 10034 | 9.4% | |
| e | 10009 | 9.4% | |
| t | 5029 | 4.7% | |
| 5029 | 4.7% | ||
| i | 5029 | 4.7% | |
| c | 5029 | 4.7% | |
| b | 5029 | 4.7% | |
| Y | 4980 | 4.7% | |
| s | 4980 | 4.7% | |
| - | 4916 | 4.6% | |
| n | 4306 | 4.0% |
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2195 |
| Missing (%) | 9.9% |
| Memory size | 172.6 KiB |
| - | |
|---|---|
| No | |
| Yes | |
| Not applicable |
| Value | Count | Frequency (%) | |
| - | 5042 | 22.8% | |
| No | 5033 | 22.8% | |
| Yes | 4975 | 22.5% | |
| Not applicable | 4838 | 21.9% | |
| (Missing) | 2195 | 9.9% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 14 |
|---|---|
| Median length | 3 |
| Mean length | 4.725354345 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| a | 11871 | 11.4% | |
| N | 9871 | 9.5% | |
| o | 9871 | 9.5% | |
| e | 9813 | 9.4% | |
| p | 9676 | 9.3% | |
| l | 9676 | 9.3% | |
| - | 5042 | 4.8% | |
| Y | 4975 | 4.8% | |
| s | 4975 | 4.8% | |
| t | 4838 | 4.6% | |
| 4838 | 4.6% | ||
| i | 4838 | 4.6% | |
| c | 4838 | 4.6% | |
| b | 4838 | 4.6% | |
| n | 4390 | 4.2% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 79624 | 76.3% | |
| Uppercase Letter | 14846 | 14.2% | |
| Dash Punctuation | 5042 | 4.8% | |
| Space Separator | 4838 | 4.6% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| N | 9871 | 66.5% | |
| Y | 4975 | 33.5% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| a | 11871 | 14.9% | |
| o | 9871 | 12.4% | |
| e | 9813 | 12.3% | |
| p | 9676 | 12.2% | |
| l | 9676 | 12.2% | |
| s | 4975 | 6.2% | |
| t | 4838 | 6.1% | |
| i | 4838 | 6.1% | |
| c | 4838 | 6.1% | |
| b | 4838 | 6.1% | |
| n | 4390 | 5.5% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 4838 | 100.0% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 5042 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 94470 | 90.5% | |
| Common | 9880 | 9.5% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| a | 11871 | 12.6% | |
| N | 9871 | 10.4% | |
| o | 9871 | 10.4% | |
| e | 9813 | 10.4% | |
| p | 9676 | 10.2% | |
| l | 9676 | 10.2% | |
| Y | 4975 | 5.3% | |
| s | 4975 | 5.3% | |
| t | 4838 | 5.1% | |
| i | 4838 | 5.1% | |
| c | 4838 | 5.1% | |
| b | 4838 | 5.1% | |
| n | 4390 | 4.6% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| - | 5042 | 51.0% | |
| 4838 | 49.0% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 104350 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| a | 11871 | 11.4% | |
| N | 9871 | 9.5% | |
| o | 9871 | 9.5% | |
| e | 9813 | 9.4% | |
| p | 9676 | 9.3% | |
| l | 9676 | 9.3% | |
| - | 5042 | 4.8% | |
| Y | 4975 | 4.8% | |
| s | 4975 | 4.8% | |
| t | 4838 | 4.6% | |
| 4838 | 4.6% | ||
| i | 4838 | 4.6% | |
| c | 4838 | 4.6% | |
| b | 4838 | 4.6% | |
| n | 4390 | 4.2% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2122 |
| Missing (%) | 9.6% |
| Memory size | 172.6 KiB |
| Yes | |
|---|---|
| No | |
| (Missing) |
| Value | Count | Frequency (%) | |
| Yes | 10012 | 45.3% | |
| No | 9949 | 45.1% | |
| (Missing) | 2122 | 9.6% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2172 |
| Missing (%) | 9.8% |
| Memory size | 172.6 KiB |
| Yes | |
|---|---|
| No | |
| (Missing) |
| Value | Count | Frequency (%) | |
| Yes | 10082 | 45.7% | |
| No | 9829 | 44.5% | |
| (Missing) | 2172 | 9.8% |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2162 |
| Missing (%) | 9.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.003062095 |
|---|---|
| Minimum | 0 |
| Maximum | 4 |
| Zeros | 3964 |
| Zeros (%) | 18.0% |
| Memory size | 172.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 4 |
| Maximum | 4 |
| Range | 4 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.411918808 |
|---|---|
| Coefficient of variation (CV) | 0.7048801987 |
| Kurtosis | -1.290355235 |
| Mean | 2.003062095 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -0.001030905292 |
| Sum | 39903 |
| Variance | 1.993514719 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=5)
| Value | Count | Frequency (%) | |
| 2 | 4117 | 18.6% | |
| 4 | 4005 | 18.1% | |
| 0 | 3964 | 18.0% | |
| 1 | 3928 | 17.8% | |
| 3 | 3907 | 17.7% | |
| (Missing) | 2162 | 9.8% |
| Value | Count | Frequency (%) | |
| 0 | 3964 | 18.0% | |
| 1 | 3928 | 17.8% | |
| 2 | 4117 | 18.6% | |
| 3 | 3907 | 17.7% | |
| 4 | 4005 | 18.1% |
| Value | Count | Frequency (%) | |
| 4 | 4005 | 18.1% | |
| 3 | 3907 | 17.7% | |
| 2 | 4117 | 18.6% | |
| 1 | 3928 | 17.8% | |
| 0 | 3964 | 18.0% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2154 |
| Missing (%) | 9.8% |
| Memory size | 172.6 KiB |
| Singular | |
|---|---|
| Multiple |
| Value | Count | Frequency (%) | |
| Singular | 9977 | 45.2% | |
| Multiple | 9952 | 45.1% | |
| (Missing) | 2154 | 9.8% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 7.512294525 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| l | 29881 | 18.0% | |
| u | 19929 | 12.0% | |
| i | 19929 | 12.0% | |
| n | 14285 | 8.6% | |
| a | 12131 | 7.3% | |
| S | 9977 | 6.0% | |
| g | 9977 | 6.0% | |
| r | 9977 | 6.0% | |
| M | 9952 | 6.0% | |
| t | 9952 | 6.0% | |
| p | 9952 | 6.0% | |
| e | 9952 | 6.0% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 145965 | 88.0% | |
| Uppercase Letter | 19929 | 12.0% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| l | 29881 | 20.5% | |
| u | 19929 | 13.7% | |
| i | 19929 | 13.7% | |
| n | 14285 | 9.8% | |
| a | 12131 | 8.3% | |
| g | 9977 | 6.8% | |
| r | 9977 | 6.8% | |
| t | 9952 | 6.8% | |
| p | 9952 | 6.8% | |
| e | 9952 | 6.8% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| S | 9977 | 50.1% | |
| M | 9952 | 49.9% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 165894 | 100.0% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| l | 29881 | 18.0% | |
| u | 19929 | 12.0% | |
| i | 19929 | 12.0% | |
| n | 14285 | 8.6% | |
| a | 12131 | 7.3% | |
| S | 9977 | 6.0% | |
| g | 9977 | 6.0% | |
| r | 9977 | 6.0% | |
| M | 9952 | 6.0% | |
| t | 9952 | 6.0% | |
| p | 9952 | 6.0% | |
| e | 9952 | 6.0% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 165894 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| l | 29881 | 18.0% | |
| u | 19929 | 12.0% | |
| i | 19929 | 12.0% | |
| n | 14285 | 8.6% | |
| a | 12131 | 7.3% | |
| S | 9977 | 6.0% | |
| g | 9977 | 6.0% | |
| r | 9977 | 6.0% | |
| M | 9952 | 6.0% | |
| t | 9952 | 6.0% | |
| p | 9952 | 6.0% | |
| e | 9952 | 6.0% |
| Distinct | 17277 |
|---|---|
| Distinct (%) | 86.7% |
| Missing | 2148 |
| Missing (%) | 9.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.486223987 |
|---|---|
| Minimum | 3 |
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 172.6 KiB |
Quantile statistics
| Minimum | 3 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 5.424703056 |
| median | 7.477132167 |
| Q3 | 9.526151897 |
| 95-th percentile | 12 |
| Maximum | 12 |
| Range | 9 |
| Interquartile range (IQR) | 4.10144884 |
Descriptive statistics
| Standard deviation | 2.653392652 |
|---|---|
| Coefficient of variation (CV) | 0.3544367169 |
| Kurtosis | -0.974504467 |
| Mean | 7.486223987 |
| Median Absolute Deviation (MAD) | 2.051562736 |
| Skewness | 0.006638939182 |
| Sum | 149237.8752 |
| Variance | 7.040492567 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 3 | 1333 | 6.0% | |
| 12 | 1327 | 6.0% | |
| 5.011905094 | 1 | < 0.1% | |
| 5.357475139 | 1 | < 0.1% | |
| 3.972519439 | 1 | < 0.1% | |
| 4.629433587 | 1 | < 0.1% | |
| 6.724889256 | 1 | < 0.1% | |
| 5.436622368 | 1 | < 0.1% | |
| 7.911525971 | 1 | < 0.1% | |
| 9.701924776 | 1 | < 0.1% | |
| 7.122433 | 1 | < 0.1% | |
| 7.274736667 | 1 | < 0.1% | |
| 10.82488782 | 1 | < 0.1% | |
| 7.289513059 | 1 | < 0.1% | |
| 9.556362234 | 1 | < 0.1% | |
| 7.149517962 | 1 | < 0.1% | |
| 9.506282272 | 1 | < 0.1% | |
| 9.591498748 | 1 | < 0.1% | |
| 5.003273321 | 1 | < 0.1% | |
| 7.526451681 | 1 | < 0.1% | |
| 7.574335794 | 1 | < 0.1% | |
| 7.496753182 | 1 | < 0.1% | |
| 7.462932928 | 1 | < 0.1% | |
| 4.561422107 | 1 | < 0.1% | |
| 7.680802907 | 1 | < 0.1% | |
| Other values (17252) | 17252 | 78.1% | |
| (Missing) | 2148 | 9.7% |
| Value | Count | Frequency (%) | |
| 3 | 1333 | 6.0% | |
| 3.000736131 | 1 | < 0.1% | |
| 3.001456988 | 1 | < 0.1% | |
| 3.003665854 | 1 | < 0.1% | |
| 3.003856548 | 1 | < 0.1% | |
| 3.00559547 | 1 | < 0.1% | |
| 3.005621539 | 1 | < 0.1% | |
| 3.005967525 | 1 | < 0.1% | |
| 3.006314905 | 1 | < 0.1% | |
| 3.008312762 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 12 | 1327 | 6.0% | |
| 11.99985747 | 1 | < 0.1% | |
| 11.99965298 | 1 | < 0.1% | |
| 11.99929293 | 1 | < 0.1% | |
| 11.99670683 | 1 | < 0.1% | |
| 11.99667763 | 1 | < 0.1% | |
| 11.99610031 | 1 | < 0.1% | |
| 11.99546766 | 1 | < 0.1% | |
| 11.99534647 | 1 | < 0.1% | |
| 11.99532318 | 1 | < 0.1% |
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2145 |
| Missing (%) | 9.7% |
| Memory size | 172.6 KiB |
| slightly abnormal | |
|---|---|
| normal | |
| inconclusive | |
| abnormal |
| Value | Count | Frequency (%) | |
| slightly abnormal | 5128 | 23.2% | |
| normal | 4954 | 22.4% | |
| inconclusive | 4952 | 22.4% | |
| abnormal | 4904 | 22.2% | |
| (Missing) | 2145 | 9.7% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 17 |
|---|---|
| Median length | 8 |
| Mean length | 10.05257438 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| l | 30194 | 13.6% | |
| n | 29180 | 13.1% | |
| a | 27163 | 12.2% | |
| o | 19938 | 9.0% | |
| i | 15032 | 6.8% | |
| r | 14986 | 6.8% | |
| m | 14986 | 6.8% | |
| s | 10080 | 4.5% | |
| b | 10032 | 4.5% | |
| c | 9904 | 4.5% | |
| g | 5128 | 2.3% | |
| h | 5128 | 2.3% | |
| t | 5128 | 2.3% | |
| y | 5128 | 2.3% | |
| 5128 | 2.3% | ||
| u | 4952 | 2.2% | |
| v | 4952 | 2.2% | |
| e | 4952 | 2.2% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 216863 | 97.7% | |
| Space Separator | 5128 | 2.3% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| l | 30194 | 13.9% | |
| n | 29180 | 13.5% | |
| a | 27163 | 12.5% | |
| o | 19938 | 9.2% | |
| i | 15032 | 6.9% | |
| r | 14986 | 6.9% | |
| m | 14986 | 6.9% | |
| s | 10080 | 4.6% | |
| b | 10032 | 4.6% | |
| c | 9904 | 4.6% | |
| g | 5128 | 2.4% | |
| h | 5128 | 2.4% | |
| t | 5128 | 2.4% | |
| y | 5128 | 2.4% | |
| u | 4952 | 2.3% | |
| v | 4952 | 2.3% | |
| e | 4952 | 2.3% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 5128 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 216863 | 97.7% | |
| Common | 5128 | 2.3% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| l | 30194 | 13.9% | |
| n | 29180 | 13.5% | |
| a | 27163 | 12.5% | |
| o | 19938 | 9.2% | |
| i | 15032 | 6.9% | |
| r | 14986 | 6.9% | |
| m | 14986 | 6.9% | |
| s | 10080 | 4.6% | |
| b | 10032 | 4.6% | |
| c | 9904 | 4.6% | |
| g | 5128 | 2.4% | |
| h | 5128 | 2.4% | |
| t | 5128 | 2.4% | |
| y | 5128 | 2.4% | |
| u | 4952 | 2.3% | |
| v | 4952 | 2.3% | |
| e | 4952 | 2.3% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 5128 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 221991 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| l | 30194 | 13.6% | |
| n | 29180 | 13.1% | |
| a | 27163 | 12.2% | |
| o | 19938 | 9.0% | |
| i | 15032 | 6.8% | |
| r | 14986 | 6.8% | |
| m | 14986 | 6.8% | |
| s | 10080 | 4.5% | |
| b | 10032 | 4.5% | |
| c | 9904 | 4.5% | |
| g | 5128 | 2.3% | |
| h | 5128 | 2.3% | |
| t | 5128 | 2.3% | |
| y | 5128 | 2.3% | |
| 5128 | 2.3% | ||
| u | 4952 | 2.2% | |
| v | 4952 | 2.2% | |
| e | 4952 | 2.2% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2155 |
| Missing (%) | 9.8% |
| Memory size | 172.6 KiB |
| 1 | |
|---|---|
| 0 | |
| (Missing) |
| Value | Count | Frequency (%) | |
| 1 | 11807 | 53.5% | |
| 0 | 8121 | 36.8% | |
| (Missing) | 2155 | 9.8% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2222 |
| Missing (%) | 10.1% |
| Memory size | 172.6 KiB |
| 1 | |
|---|---|
| 0 | |
| (Missing) |
| Value | Count | Frequency (%) | |
| 1 | 10961 | 49.6% | |
| 0 | 8900 | 40.3% | |
| (Missing) | 2222 | 10.1% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2101 |
| Missing (%) | 9.5% |
| Memory size | 172.6 KiB |
| 1 | |
|---|---|
| 0 | |
| (Missing) |
| Value | Count | Frequency (%) | |
| 1 | 10715 | 48.5% | |
| 0 | 9267 | 42.0% | |
| (Missing) | 2101 | 9.5% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2113 |
| Missing (%) | 9.6% |
| Memory size | 172.6 KiB |
| 0 | |
|---|---|
| 1 | |
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 10030 | 45.4% | |
| 1 | 9940 | 45.0% | |
| (Missing) | 2113 | 9.6% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2153 |
| Missing (%) | 9.7% |
| Memory size | 172.6 KiB |
| 0 | |
|---|---|
| 1 | |
| (Missing) |
| Value | Count | Frequency (%) | |
| 0 | 10724 | 48.6% | |
| 1 | 9206 | 41.7% | |
| (Missing) | 2153 | 9.7% |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2146 |
| Missing (%) | 9.7% |
| Memory size | 172.6 KiB |
| Mitochondrial genetic inheritance disorders | |
|---|---|
| Single-gene inheritance diseases | |
| Multifactorial genetic inheritance disorders |
| Value | Count | Frequency (%) | |
| Mitochondrial genetic inheritance disorders | 10202 | 46.2% | |
| Single-gene inheritance diseases | 7664 | 34.7% | |
| Multifactorial genetic inheritance disorders | 2071 | 9.4% | |
| (Missing) | 2146 | 9.7% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 44 |
|---|---|
| Median length | 43 |
| Mean length | 35.38903229 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| e | 115013 | 14.7% | |
| i | 104294 | 13.3% | |
| n | 81969 | 10.5% | |
| r | 56756 | 7.3% | |
| 52147 | 6.7% | ||
| s | 47538 | 6.1% | |
| t | 46554 | 6.0% | |
| c | 44483 | 5.7% | |
| a | 44091 | 5.6% | |
| d | 42412 | 5.4% | |
| o | 34748 | 4.4% | |
| h | 30139 | 3.9% | |
| g | 27601 | 3.5% | |
| l | 22008 | 2.8% | |
| M | 12273 | 1.6% | |
| S | 7664 | 1.0% | |
| - | 7664 | 1.0% | |
| u | 2071 | 0.3% | |
| f | 2071 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 701748 | 89.8% | |
| Space Separator | 52147 | 6.7% | |
| Uppercase Letter | 19937 | 2.6% | |
| Dash Punctuation | 7664 | 1.0% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| M | 12273 | 61.6% | |
| S | 7664 | 38.4% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| e | 115013 | 16.4% | |
| i | 104294 | 14.9% | |
| n | 81969 | 11.7% | |
| r | 56756 | 8.1% | |
| s | 47538 | 6.8% | |
| t | 46554 | 6.6% | |
| c | 44483 | 6.3% | |
| a | 44091 | 6.3% | |
| d | 42412 | 6.0% | |
| o | 34748 | 5.0% | |
| h | 30139 | 4.3% | |
| g | 27601 | 3.9% | |
| l | 22008 | 3.1% | |
| u | 2071 | 0.3% | |
| f | 2071 | 0.3% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 52147 | 100.0% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 7664 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 721685 | 92.3% | |
| Common | 59811 | 7.7% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| e | 115013 | 15.9% | |
| i | 104294 | 14.5% | |
| n | 81969 | 11.4% | |
| r | 56756 | 7.9% | |
| s | 47538 | 6.6% | |
| t | 46554 | 6.5% | |
| c | 44483 | 6.2% | |
| a | 44091 | 6.1% | |
| d | 42412 | 5.9% | |
| o | 34748 | 4.8% | |
| h | 30139 | 4.2% | |
| g | 27601 | 3.8% | |
| l | 22008 | 3.0% | |
| M | 12273 | 1.7% | |
| S | 7664 | 1.1% | |
| u | 2071 | 0.3% | |
| f | 2071 | 0.3% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 52147 | 87.2% | ||
| - | 7664 | 12.8% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 781496 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| e | 115013 | 14.7% | |
| i | 104294 | 13.3% | |
| n | 81969 | 10.5% | |
| r | 56756 | 7.3% | |
| 52147 | 6.7% | ||
| s | 47538 | 6.1% | |
| t | 46554 | 6.0% | |
| c | 44483 | 5.7% | |
| a | 44091 | 5.6% | |
| d | 42412 | 5.4% | |
| o | 34748 | 4.4% | |
| h | 30139 | 3.9% | |
| g | 27601 | 3.5% | |
| l | 22008 | 2.8% | |
| M | 12273 | 1.6% | |
| S | 7664 | 1.0% | |
| - | 7664 | 1.0% | |
| u | 2071 | 0.3% | |
| f | 2071 | 0.3% |
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2168 |
| Missing (%) | 9.8% |
| Memory size | 172.6 KiB |
| Leigh syndrome | |
|---|---|
| Mitochondrial myopathy | |
| Cystic fibrosis | |
| Tay-Sachs | |
| Diabetes | |
| Other values (4) |
| Value | Count | Frequency (%) | |
| Leigh syndrome | 5160 | 23.4% | |
| Mitochondrial myopathy | 4405 | 19.9% | |
| Cystic fibrosis | 3448 | 15.6% | |
| Tay-Sachs | 2833 | 12.8% | |
| Diabetes | 1817 | 8.2% | |
| Hemochromatosis | 1355 | 6.1% | |
| Leber's hereditary optic neuropathy | 648 | 2.9% | |
| Alzheimer's | 152 | 0.7% | |
| Cancer | 97 | 0.4% | |
| (Missing) | 2168 | 9.8% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 35 |
|---|---|
| Median length | 14 |
| Mean length | 14.15867409 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| i | 28934 | 9.3% | |
| o | 27184 | 8.7% | |
| s | 23664 | 7.6% | |
| y | 21547 | 6.9% | |
| a | 21209 | 6.8% | |
| h | 19606 | 6.3% | |
| e | 18950 | 6.1% | |
| t | 17374 | 5.6% | |
| r | 17209 | 5.5% | |
| 14957 | 4.8% | ||
| n | 14646 | 4.7% | |
| c | 12786 | 4.1% | |
| m | 12427 | 4.0% | |
| d | 10213 | 3.3% | |
| b | 5913 | 1.9% | |
| L | 5808 | 1.9% | |
| p | 5701 | 1.8% | |
| g | 5160 | 1.7% | |
| l | 4557 | 1.5% | |
| M | 4405 | 1.4% | |
| C | 3545 | 1.1% | |
| f | 3448 | 1.1% | |
| T | 2833 | 0.9% | |
| - | 2833 | 0.9% | |
| S | 2833 | 0.9% | |
| Other values (6) | 4924 | 1.6% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 271328 | 86.8% | |
| Uppercase Letter | 22748 | 7.3% | |
| Space Separator | 14957 | 4.8% | |
| Dash Punctuation | 2833 | 0.9% | |
| Other Punctuation | 800 | 0.3% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| L | 5808 | 25.5% | |
| M | 4405 | 19.4% | |
| C | 3545 | 15.6% | |
| T | 2833 | 12.5% | |
| S | 2833 | 12.5% | |
| D | 1817 | 8.0% | |
| H | 1355 | 6.0% | |
| A | 152 | 0.7% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| i | 28934 | 10.7% | |
| o | 27184 | 10.0% | |
| s | 23664 | 8.7% | |
| y | 21547 | 7.9% | |
| a | 21209 | 7.8% | |
| h | 19606 | 7.2% | |
| e | 18950 | 7.0% | |
| t | 17374 | 6.4% | |
| r | 17209 | 6.3% | |
| n | 14646 | 5.4% | |
| c | 12786 | 4.7% | |
| m | 12427 | 4.6% | |
| d | 10213 | 3.8% | |
| b | 5913 | 2.2% | |
| p | 5701 | 2.1% | |
| g | 5160 | 1.9% | |
| l | 4557 | 1.7% | |
| f | 3448 | 1.3% | |
| u | 648 | 0.2% | |
| z | 152 | 0.1% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| ' | 800 | 100.0% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 14957 | 100.0% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 2833 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 294076 | 94.1% | |
| Common | 18590 | 5.9% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| i | 28934 | 9.8% | |
| o | 27184 | 9.2% | |
| s | 23664 | 8.0% | |
| y | 21547 | 7.3% | |
| a | 21209 | 7.2% | |
| h | 19606 | 6.7% | |
| e | 18950 | 6.4% | |
| t | 17374 | 5.9% | |
| r | 17209 | 5.9% | |
| n | 14646 | 5.0% | |
| c | 12786 | 4.3% | |
| m | 12427 | 4.2% | |
| d | 10213 | 3.5% | |
| b | 5913 | 2.0% | |
| L | 5808 | 2.0% | |
| p | 5701 | 1.9% | |
| g | 5160 | 1.8% | |
| l | 4557 | 1.5% | |
| M | 4405 | 1.5% | |
| C | 3545 | 1.2% | |
| f | 3448 | 1.2% | |
| T | 2833 | 1.0% | |
| S | 2833 | 1.0% | |
| D | 1817 | 0.6% | |
| H | 1355 | 0.5% | |
| Other values (3) | 952 | 0.3% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 14957 | 80.5% | ||
| - | 2833 | 15.2% | |
| ' | 800 | 4.3% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 312666 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| i | 28934 | 9.3% | |
| o | 27184 | 8.7% | |
| s | 23664 | 7.6% | |
| y | 21547 | 6.9% | |
| a | 21209 | 6.8% | |
| h | 19606 | 6.3% | |
| e | 18950 | 6.1% | |
| t | 17374 | 5.6% | |
| r | 17209 | 5.5% | |
| 14957 | 4.8% | ||
| n | 14646 | 4.7% | |
| c | 12786 | 4.1% | |
| m | 12427 | 4.0% | |
| d | 10213 | 3.3% | |
| b | 5913 | 1.9% | |
| L | 5808 | 1.9% | |
| p | 5701 | 1.8% | |
| g | 5160 | 1.7% | |
| l | 4557 | 1.5% | |
| M | 4405 | 1.4% | |
| C | 3545 | 1.1% | |
| f | 3448 | 1.1% | |
| T | 2833 | 0.9% | |
| - | 2833 | 0.9% | |
| S | 2833 | 0.9% | |
| Other values (6) | 4924 | 1.6% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| Patient_Id | Patient_Age | Genes_in_mothers_side | Inherited_from_father | Maternal_gene | Paternal_gene | Blood_cell_count_(mcL) | Patient_First_Name | Family_Name | Fathers_name | Mothers_age | Fathers_age | Institute_Name | Location_of_Institute | Status | Respiratory_Rate_(breaths/min) | Heart_Rate_(rates/min | Test_1 | Test_2 | Test_3 | Test_4 | Test_5 | Parental_consent | Follow-up | Gender | Birth_asphyxia | Autopsy_shows_birth_defect_(if_applicable) | Place_of_birth | Folic_acid_details_(peri-conceptional) | H/O_serious_maternal_illness | H/O_radiation_exposure_(x-ray) | H/O_substance_abuse | Assisted_conception_IVF/ART | History_of_anomalies_in_previous_pregnancies | No._of_previous_abortion | Birth_defects | White_Blood_cell_count_(thousand_per_microliter) | Blood_test_result | Symptom_1 | Symptom_2 | Symptom_3 | Symptom_4 | Symptom_5 | Genetic_Disorder | Disorder_Subclass | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | PID0x6418 | 2.0 | Yes | No | Yes | No | 4.760603 | Richard | NaN | Larre | NaN | NaN | Boston Specialty & Rehabilitation Hospital | 55 FRUIT ST\nCENTRAL, MA 02114\n(42.36247485742686, -71.06924724545246) | Alive | Normal (30-60) | Normal | 0.0 | NaN | NaN | 1.0 | 0.0 | Yes | High | NaN | NaN | Not applicable | Institute | No | NaN | No | No | No | Yes | NaN | NaN | 9.857562 | NaN | 1.0 | 1.0 | 1.0 | 1.0 | 1.0 | Mitochondrial genetic inheritance disorders | Leber's hereditary optic neuropathy |
| 1 | PID0x25d5 | 4.0 | Yes | Yes | No | No | 4.910669 | Mike | NaN | Brycen | NaN | 23.0 | St. Margaret's Hospital For Women | 1515 COMMONWEALTH AV\nALLSTON/BRIGHTON, MA 02135\n(42.34665771451756, -71.14136122385321) | Deceased | Tachypnea | Normal | NaN | 0.0 | 0.0 | 1.0 | 0.0 | Yes | High | NaN | No | None | NaN | Yes | Yes | Not applicable | Not applicable | No | Yes | NaN | Multiple | 5.522560 | normal | 1.0 | NaN | 1.0 | 1.0 | 0.0 | NaN | Cystic fibrosis |
| 2 | PID0x4a82 | 6.0 | Yes | No | No | No | 4.893297 | Kimberly | NaN | Nashon | 41.0 | 22.0 | NaN | - | Alive | Normal (30-60) | Tachycardia | 0.0 | 0.0 | 0.0 | 1.0 | 0.0 | Yes | Low | NaN | No record | Not applicable | NaN | Yes | No | Yes | NaN | Yes | Yes | 4.0 | Singular | NaN | normal | 0.0 | 1.0 | 1.0 | 1.0 | 1.0 | Multifactorial genetic inheritance disorders | Diabetes |
| 3 | PID0x4ac8 | 12.0 | Yes | No | Yes | No | 4.705280 | Jeffery | Hoelscher | Aayaan | 21.0 | NaN | NaN | 55 FRUIT ST\nCENTRAL, MA 02114\n(42.36247485742686, -71.06924724545246) | Deceased | Tachypnea | Normal | 0.0 | 0.0 | 0.0 | 1.0 | 0.0 | Yes | High | Male | Not available | No | Institute | No | Yes | - | Not applicable | NaN | Yes | 1.0 | Singular | 7.919321 | inconclusive | 0.0 | 0.0 | 1.0 | 0.0 | 0.0 | Mitochondrial genetic inheritance disorders | Leigh syndrome |
| 4 | PID0x1bf7 | 11.0 | Yes | No | NaN | Yes | 4.720703 | Johanna | Stutzman | Suave | 32.0 | NaN | Carney Hospital | 300 LONGWOOD AV\nFENWAY/KENMORE, MA 02115\n(42.337592548462226, -71.10472284437952) | Alive | Tachypnea | Tachycardia | 0.0 | 0.0 | 0.0 | 1.0 | 0.0 | NaN | Low | Male | Not available | Not applicable | Institute | No | Yes | - | Not applicable | Yes | No | 4.0 | Multiple | 4.098210 | NaN | 0.0 | 0.0 | 0.0 | 0.0 | NaN | Multifactorial genetic inheritance disorders | Cancer |
| 5 | PID0x44fe | 14.0 | Yes | No | Yes | No | 5.103188 | Richard | NaN | Coleston | NaN | NaN | Massachusetts General Hospital | 55 FRUIT ST\nCENTRAL, MA 02114\n(42.36247485742686, -71.06924724545246) | Deceased | NaN | Normal | 0.0 | 0.0 | 0.0 | 1.0 | 0.0 | Yes | Low | Female | Not available | None | Institute | No | No | No | No | NaN | No | 0.0 | Multiple | 10.272230 | normal | 1.0 | 0.0 | 0.0 | 1.0 | 0.0 | Single-gene inheritance diseases | Cystic fibrosis |
| 6 | PID0x28de | 3.0 | Yes | No | Yes | Yes | 4.901080 | Mary | NaN | Aydun | NaN | 63.0 | Not applicable | - | Alive | Normal (30-60) | NaN | NaN | 0.0 | 0.0 | 1.0 | 0.0 | NaN | Low | Male | No record | Not applicable | Home | NaN | Yes | No | Not applicable | Yes | No | 3.0 | Multiple | 6.825974 | normal | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | Single-gene inheritance diseases | Tay-Sachs |
| 7 | PID0x4f8f | 3.0 | No | No | Yes | Yes | 4.964816 | Emma | Bryant | Keng | 40.0 | NaN | Not applicable | - | Alive | Tachypnea | Normal | 0.0 | 0.0 | NaN | 1.0 | 0.0 | Yes | Low | NaN | No record | Not applicable | Home | Yes | Yes | No | - | No | Yes | 1.0 | Singular | 9.836352 | inconclusive | 0.0 | 0.0 | 1.0 | NaN | 0.0 | Single-gene inheritance diseases | Tay-Sachs |
| 8 | PID0x8ce3 | 11.0 | No | No | Yes | No | 5.209058 | Willie | Camacho | Tr | 45.0 | 44.0 | Lemuel Shattuck Hospital | 125 NASHUA ST\nCENTRAL, MA 02114\n(42.36764789068138, -71.06564730220646) | Alive | Tachypnea | Tachycardia | 0.0 | 0.0 | 0.0 | 1.0 | 0.0 | Yes | Low | Male | Yes | Not applicable | Institute | Yes | Yes | No | No | No | Yes | 0.0 | Multiple | 6.669552 | slightly abnormal | 1.0 | 1.0 | 1.0 | 0.0 | 1.0 | Mitochondrial genetic inheritance disorders | Leigh syndrome |
| 9 | PID0x8660 | 4.0 | No | Yes | Yes | Yes | 4.752272 | John | Sandoval | Gregori | 44.0 | 42.0 | Shriners Burns Institute | 1200 Centre St\nRoslindale, MA 02131\n(42.29738386053219, -71.13150465441208) | Alive | Tachypnea | Tachycardia | 0.0 | 0.0 | 0.0 | 1.0 | 0.0 | Yes | Low | Male | No | Not applicable | Institute | Yes | No | No | No | Yes | Yes | 1.0 | Multiple | 6.397702 | abnormal | 0.0 | 0.0 | 1.0 | 1.0 | 1.0 | Multifactorial genetic inheritance disorders | Diabetes |
Last rows
| Patient_Id | Patient_Age | Genes_in_mothers_side | Inherited_from_father | Maternal_gene | Paternal_gene | Blood_cell_count_(mcL) | Patient_First_Name | Family_Name | Fathers_name | Mothers_age | Fathers_age | Institute_Name | Location_of_Institute | Status | Respiratory_Rate_(breaths/min) | Heart_Rate_(rates/min | Test_1 | Test_2 | Test_3 | Test_4 | Test_5 | Parental_consent | Follow-up | Gender | Birth_asphyxia | Autopsy_shows_birth_defect_(if_applicable) | Place_of_birth | Folic_acid_details_(peri-conceptional) | H/O_serious_maternal_illness | H/O_radiation_exposure_(x-ray) | H/O_substance_abuse | Assisted_conception_IVF/ART | History_of_anomalies_in_previous_pregnancies | No._of_previous_abortion | Birth_defects | White_Blood_cell_count_(thousand_per_microliter) | Blood_test_result | Symptom_1 | Symptom_2 | Symptom_3 | Symptom_4 | Symptom_5 | Genetic_Disorder | Disorder_Subclass | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 22073 | PID0xbd | 13.0 | Yes | Yes | No | Yes | 4.874635 | Rosa | NaN | Donovin | 44.0 | 62.0 | Not applicable | - | Alive | Tachypnea | Tachycardia | 0.0 | NaN | 0.0 | 1.0 | 0.0 | Yes | NaN | NaN | No record | Not applicable | Home | No | NaN | No | No | Yes | No | 1.0 | NaN | NaN | normal | 0.0 | 0.0 | 0.0 | 1.0 | 1.0 | NaN | Leigh syndrome |
| 22074 | PID0x6a0a | 4.0 | No | No | NaN | No | 4.789307 | Randy | Howell | Javontay | 35.0 | 51.0 | Beth Israel Deaconess Medical Center West Cam | 88 EAST NEWTON ST\nSOUTH END, MA 02118\n(42.3371094801158, -71.07139912234962) | Alive | Tachypnea | Normal | 0.0 | 0.0 | 0.0 | 1.0 | 0.0 | Yes | Low | Male | Yes | Not applicable | Institute | No | No | - | No | No | No | 3.0 | Multiple | NaN | normal | 0.0 | 0.0 | 1.0 | 0.0 | 0.0 | Single-gene inheritance diseases | Hemochromatosis |
| 22075 | PID0x5f56 | 10.0 | No | No | Yes | Yes | 4.643860 | Edward | Thomas | Eoghan | 49.0 | NaN | Not applicable | - | Deceased | NaN | Normal | 0.0 | 0.0 | 0.0 | 1.0 | 0.0 | NaN | Low | NaN | No | Yes | Home | No | NaN | NaN | Not applicable | Yes | Yes | 2.0 | Multiple | 9.581455 | abnormal | 1.0 | 0.0 | 0.0 | 0.0 | NaN | Mitochondrial genetic inheritance disorders | Mitochondrial myopathy |
| 22076 | PID0x26b4 | 0.0 | Yes | No | Yes | No | 4.931758 | Samuel | NaN | Kiril | NaN | 50.0 | Lemuel Shattuck Hospital | 88 EAST NEWTON ST\nSOUTH END, MA 02118\n(42.3371094801158, -71.07139912234962) | Alive | Normal (30-60) | Tachycardia | 0.0 | 0.0 | 0.0 | 1.0 | 0.0 | Yes | Low | Female | No record | Not applicable | Institute | No | No | Not applicable | No | Yes | Yes | 1.0 | Singular | 11.649052 | abnormal | 1.0 | 1.0 | 0.0 | 1.0 | 0.0 | Mitochondrial genetic inheritance disorders | Leigh syndrome |
| 22077 | PID0x3656 | 9.0 | No | Yes | Yes | Yes | 5.012599 | Edward | Hurst | Quientin | 47.0 | NaN | Not applicable | - | Deceased | NaN | Normal | 0.0 | NaN | 0.0 | 1.0 | 0.0 | Yes | NaN | Ambiguous | No record | Yes | Home | Yes | No | No | Not applicable | Yes | Yes | NaN | NaN | 12.000000 | slightly abnormal | NaN | 1.0 | 0.0 | 0.0 | 0.0 | Mitochondrial genetic inheritance disorders | Leigh syndrome |
| 22078 | PID0x5598 | 4.0 | Yes | Yes | Yes | No | 5.258298 | Lynn | NaN | Alhassane | 35.0 | 64.0 | Franciscan Children's Hospital | 1153 CENTRE ST\nJAMAICA PLAIN, MA 02130\n(42.30021828265608, -71.12789683059322) | Deceased | Normal (30-60) | Tachycardia | NaN | 0.0 | NaN | 1.0 | 0.0 | Yes | High | Female | No | No | Institute | NaN | No | Not applicable | No | Yes | No | 3.0 | Multiple | 6.584811 | inconclusive | 0.0 | 0.0 | 1.0 | 0.0 | 0.0 | Mitochondrial genetic inheritance disorders | Leigh syndrome |
| 22079 | PID0x19cb | 8.0 | No | Yes | No | Yes | 4.974220 | Matthew | Farley | Dartanion | NaN | 56.0 | Faulkner Hospital | 170 MORTON ST\nROSLINDALE, MA 02130\n(42.30025000839615, -71.10737910445549) | Alive | Normal (30-60) | Normal | NaN | 0.0 | NaN | 1.0 | NaN | NaN | High | Ambiguous | No | Not applicable | Institute | Yes | Yes | No | - | Yes | No | 2.0 | Multiple | 7.041556 | inconclusive | 1.0 | 1.0 | 1.0 | 1.0 | 0.0 | Multifactorial genetic inheritance disorders | Diabetes |
| 22080 | PID0x3c4f | 8.0 | Yes | No | Yes | No | 5.186470 | John | NaN | Cavani | 35.0 | 51.0 | Not applicable | - | Deceased | Tachypnea | Normal | 0.0 | 0.0 | 0.0 | 1.0 | NaN | Yes | High | Male | No | None | Home | No | No | NaN | No | No | No | 2.0 | Singular | 7.715464 | normal | 0.0 | 0.0 | 0.0 | 1.0 | NaN | Mitochondrial genetic inheritance disorders | Mitochondrial myopathy |
| 22081 | PID0x13a | 7.0 | Yes | No | Yes | Yes | 4.858543 | Sharon | NaN | Bomer | 19.0 | NaN | Not applicable | - | Alive | Tachypnea | Tachycardia | 0.0 | 0.0 | 0.0 | 1.0 | 0.0 | Yes | High | Male | No record | Not applicable | Home | Yes | Yes | - | Yes | Yes | No | 1.0 | Multiple | 8.437670 | abnormal | 1.0 | 1.0 | 1.0 | 0.0 | 0.0 | NaN | Leigh syndrome |
| 22082 | PID0x9332 | 11.0 | Yes | No | No | No | 4.738067 | Andrew | Mose | Eban | 32.0 | 62.0 | Hebrew Rehabilitation Center | 300 LONGWOOD AV\nFENWAY/KENMORE, MA 02115\n(42.337592548462226, -71.10472284437952) | Deceased | Normal (30-60) | Normal | 0.0 | 0.0 | 0.0 | 1.0 | 0.0 | Yes | High | Female | Yes | None | Institute | Yes | Yes | Not applicable | No | Yes | Yes | 4.0 | Singular | 11.188371 | normal | 1.0 | 0.0 | 1.0 | 1.0 | 1.0 | Multifactorial genetic inheritance disorders | Diabetes |